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CELL-BASED ASSAY 

Technical Field f the Invention 

This invention relates to materials and methods for the identification of inhibitors of certain 
5 molecular interactions. 

Background of the Invention 

Cellular signal transduction, i.e., the series of events leading from extracellular events to 
intracellular sequelae, is an aspect of cellular function in both normal and disease states. Numerous 

0 proteins that function as signal transducing molecules have been identified, including receptor and 
nonreceptor tyrosine kinases, phosphatases and other molecules with enzymatic or regulatory 
activities. These molecules generally demonstrate the capacity to associate specifically with other 
proteins to form a signaling complex that can alter cell activity. 

Signaling proteins often contain domains of conserved sequence which serve as non-catalytic 

S modules that direct protein-protein interactions during signal transduction. One such domain is the 
Src homology domain 2 (SH2), which is found in various combinations and locations in different 
proteins. For example, some members of the Src-tamily of tyrosine kinases, e.g., Abl, QRB2 and 
PI3K, each contain one SH2 domain. Other tyrosine kinases, such as ZAP and Syk, contain two SH2 
domains. The presence of multiple SH2 domains within a protein increases the variety of potential 

0 protein-Drotein interactions. 

SH2 domains direct the association of specific proteins or protein domains by binding 
selectively and with specificity to protein sequences or motifs containing phosphotyrosine. For 
example, upon ligand binding, the PDGF beta-receptor dimerizes and autophosphoryiates multiple 
tyrosine residues. This phosphorylation of tyrosine-containing protein motifs within the receptor 

5 triggers its physical association with SH2-containing proteins such as c-src, PLC -gamma, PI3K 
and ras-GAP, forming a signaling complex. Other examples include the binding of the tandem SH2- 
containing protein Syk to tyrosine-phosphorylated motifs on the beta or gamma subunits of the IgE 
receptor, and the binding of the tandem SH2-containing protein ZAP-70 with tyrosine- 
phosphorylated motifs on subunits of the T cell receptor. Other protein receptors for tyrosine- 

) phosphorylated protein domains are known and include the so-called phosphotyrosine binding 
domains ("PTBs w ) or phosphotyrosine interaction domains ("PIDs"). Many signaling pathways 
which play critical roles in disease processes are mediated by the binding of a phosphotyrosine- 
containing protein or protein domain with an SH2 or other protein receptor for a tyrosine- 
phosphorylated domain. 

1 Pharmaceutical agents which interfere with the formation or stability of such signaling 
complexes may be used for precise intervention in these complex biological processes in order to 
treat or prevent the diseases or pathological effects mediated by such signaling. However, one 
common but significant problem when screening for pharmaceuticalty useful compounds that 
disrupt specific cellular events is deciphering their mechanism and specificity of action. Typically 

) one searches for inhibition of a signaling cascade by measuring the effect of test compounds on an 
event several, if not many, steps downstream of a protein-protein interaction of particular 
interest. To the extent that a compound may exert its effects on any number of intermediate steps, 
the results of such experiments can be difficult to interpret. 

A so-called ^two-hybrid" Interaction assay described by Song and Fields, Nature, 340:245- 

i 247 (1989) has been used to detect the interactions of a variety of molecules. See also. Fields et 
a!, US Patent No. 5,283,173 (1 Feb 1994). The two-hybrid assay is based on the observation that 
transcription factors contain separable functional modules that direct either DNA binding or 
transcription activation. A DNA binding domain expressed in cells will bind to DNA but not activate 
transcription as it lacks a transcription activation domain. Conversely, a transcription activation 



i 



PCT/US9 7/02635 

WO 97/31113 



h m «ir. alone will not affect transcription in the abs nee of directed and/or intimat interaction 
tZ n J «s would be provided by a DNA-bindmg domain. H wever. if the DNA binding domaw 
US SruaSr^Taciviion domain, are each expressed as part of separa. fusion proteins, and 
£ So"!£2» are capable of associating, the tw -hybrid* complex so formed represents a 
?^S**™*£p*o» *«*°< FIG. 1). Such a reconstituted transcription factor * capable 
7ES2?t£!ZL of a reporter gene (e.g.. a gene for a convenierttiy ^actable marker 
! Ls beta-oalactosidase or alkaline phosphatase (SEAP) or a protein important for ceU 
£m£ ?o«te 9 d downstream of DNA binding sites recognized by«- DNA-bindrng Jom^^ 
amount of reporter gene expression, i.e.. the amount of gene product produced, will reflect the 
So lS flon Sterns complex with one another. Compounds that block trie association 
thP fusion Droteins will reduce reporter gene expression. 

TnTu^y^kJ assay approach has been used to identify presumed SH2-dependent protein- 
„„, e J V^SZ usmg yeast [Xing. 2. et al.. Mol. Biol. Cell. 5:413-421 (1994) and Osborne, 
^et aT eTot^hnoT 9 1 3:1474-1478 (1995)). In those experiments, protein binding pairs. 
£ner man inn btors of such binding, were identified. The two hybrid approach has also been used 
^ ri-Ucf mrinhibttion of cenain protein-protein interactions in yeast, but not tor inhibition of 
"^S^^SS^*^ phosphorated .igands ^^T^T^ 
■i al FEBS Lett 357: 221-226 (1995)]. While a yeast-based two-hybnd approach may be 
useti for ofher purposes, it wou.d no, lend itseff to screening for drugs to inhibit phosphopeptide- 
med"a,ed in erection mammalian ce.ls since the use of yeast cells would limit the researcher, 
or a least "n to the subset of compounds which are ab.e to penetrate the yeast cel. wall. 
THit l^«cia. limuat on would incorporate into the experimental design trie risk o, rn.ss.ng 
importer* ^w compoCnds capable * backing key signaling interactions in mammaUan cells, bu, 

^ C V a h e ere Thus 6 ZSSTZEZTm tor improved methods and composhionj tor rtentify* g 
inhibitors of phosphopeptide-mediated protein-protein binding in a wide vanety of host cells. 

"^Th^prtU^nrnaTdresses this need by providing a mechanism-based in vivo ^ssay > that 
allows ; the identification of inhibitory compounds that target key molecules .nvolved in specific 
signal tran^uaion pathway ^ ^ recombinant DNA 

, ' * ™^ir« « Dair o, f usi0 n proteins. One DNA molecule encodes a transection activating 
^SS^S^ S^^ -ore transcription activation domains (TADs) and a, toast 
one IZ o ^ Phosphopeptide binding pair. i.e.. a phosphopeptide binding domain or peptide 
Knand thTre to molecule encodes a DNA-binding fusion protein which comprises one 

ofmor ^DNA bio^ doma^s (DBDs) and the other member of the phosphopeptide *n*ngp>-- 
ThTelu further contains a reporter gene which is linked to a DNA sequence to which the DNA- 
oinding VZ ^prSSTs capeote of Ending. The reporter gene encodes a detectable gene product 
wnS is3e«eS when transcription of the reporter gene is activated. e.^foUow.ng ^^ou 
Tf ne LoTu^ion proteins to reconstitute an effective transcription factor. The engineered cell 
^ ^fr^LntS a oretein kinase capable of phosphorylating one or more of the tyrosine residues 
oTm. pe^SHglS asTs^Juired for two-hybrid formation. The k.nase may be endogenous to the 
ceTor n?y ol h? gene product of an introduced, heterologous gene. In one embodiment ^ne °f the 
Sion ™Uems conWa kinase doma.n and thus provides its own kinase- h£»£°"- o^bottof 
the fusion proteins may optionally contain a nuclear hormone-binding domain (HBD) perm tting 
reoulated seoueslering or inaclivation of the fusion protein, .n such embodiments, addition of me 
h^rmon? SS?™*-^ -ion prote.n avaiiable for T^J^^ *" 
cell expresses the two fusion proteins, and the peptide bgand domain has become tyrosine 
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phosphorylated. the two fusion proteins are capable f binding to each other via comptexation of the 
phosphorylated peptide iigand of one fusion protein with the PBD of the other. So compl xed. the 
fust n prot ins are capable of detectabiy activating the transcription of the reporter gene linked to 
a DNA s quenc recognized (i.e.. bound) by the DNA-binding domain of one f the fusion proteins. 
5 One object of the invention is to provide the above-described DNA molecules. Another is to 

introduce the DNA molecules into host cells to provide genetically engineered cells useful tor 
conducting the assay methods described herein. 

Another object of the invention is to provide a method for identifying the presence in a test 
composition of an inhibitor of the binding of a tyrosine-phosphorytated iigand with a 
0 phosphopeptide binding domain therefor or of a biological activity mediated by that phosphopeptide 
binding pair. This method involves culturing or maintaining the above-described genetically 
engineered ceils in a medium suitable for cell growth, in the presence and absence of the test 
composition, and determining whether production of the detectable reporter gene product was 
diminished in the presence of the test composition, in practice, the test composition is added to the 
5 cells, e.g. to the medium in which the cells are cultured, and the culture is incubated under 
conditions permitting formation of a complex between the fusion proteins. In embodiments 
involving hormone regulation, the hormone may be added before, during or. as may often be 
preferable, after addition of the test composition, tf binding of the DNA-binding fusion protein to 
the transcription activating fusion protein occurs to a lesser extent in the presence of the test 
) composition than in its absence, i.e.. if the presence or increased concentration of the test 
composition reduces the concentration of two-hybrid complex, then the test composition is or 
contains a phosphopeptide binding pair inhibitor. The presence or absence of two-hybrid complex 
may be monitored by measuring the the level of expression of the reporter gene. 

To recap, an illustrative in vivo assay format relies upon genetically engineered cells capable 
of expressing a reporter gene under PBP-mediated transcriptional control. These cells contain a 
phosphopeptide binding domain of interest and a corresponding peptide Iigand, each in the form of a 
distinct fusion protein. Each such fusion protein thus comprises, among other component regions, 
at least Dne PBD or peptide Iigand sequence. The two fusion protein are designed to be capable of 
forming a complex wirn each other through binding of the tyros ine-phospho ry late d Iigand domain of 
one fusion protein to the PBD of the other fusion protein. In the presence of a kinase activity 
capable of tyros ine-phosphorylating the Iigand domain, as discussed elsewhere, and in hormone- 
dependent embodiments, in the presence of the necessary hormone, the cells express the reporter 
gene unless an inhibitory substance is present which interferes with the formation or persistence 
of the two-hybrid complex required for transcription of the reporter gene. In this assay, the cells 
are cultured or maintained in a suitable culture medium to establish a base-line for expression of 
the reporter gene. The test composition is added to the culture medium and the ability of the test 
composition to inhibit expression of the reporter gene is measured. A series of experiments with 
different concentrations of the test compositions can be conducted in parallel. If the level of 
reporter gene expression is reduced in the presence of the test composition, the test composition is 
an inhibitor. It the structure of the inhibitor so identified is not yet known, the compound may then 
be isolated from the other assay components and characterized. It may be re-evaluated, rf desired, 
using engineered cells expressing analogous fusion proteins but bearing different PBDs and/or 
peptide iigand domains. This provides a means to confirm the selectivity of the interaction of the 
inhibitor with the PBP which it was identified. If desired, the binding affinity of the inhibitor for 
the PBD or peptide Iigand domain with which H was identified may be determined, e.g. such as 
through the use of BIAcore® technology. The inhibitor so identified may be assayed in an in vitro 
binding assay as described above and may further be evaluated for pharmacological and/or 
toxicologicai activity in various in vitro and/or in vivo assays, as desired. 
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A wide variety of test compositions can be assay d in accordance with this inv nbon 
includJia e g microbial broths, cellular xtracts. conditioned media from cell lines or from host 
clhsWnsfonned with genetic librari s, collections ol synthetic compounds, combmatorial 
Sraries w other compounds or mbctur s from synthetic programs based on convent.onal 
moHintnai chemist rv aDproaches or structure-based drug design. 

^i^enTon S provides a means tor identifying selective inhibitors of phosphopeptide 
hindino interactions As noted at the outset, phosphopeptide binding domains are present in a wide 
SLrtw of orSSS fncluding proteins involved in Intracellular signal transduction Pathways 
relevant o ™ umbe of important norma, and disease processes. Accordingly. inhibrtors Klentffied 
m ii/invention may be useful tor a variety of purposes. First, they may be useful as 
wSSaT eaoemT^ astayfas described herein for functional classification of a phosphopepbde 
^maTcr ^ pho^opepUde receptor of a particular protein, particularly a newty discovered 
ProTeTn FammeTor dasses of such proteins may thus be defined functionally, with respect to 

bin ^So^tr C Shibi«ory agents of this invention can be used ,o inhibit the occurence o, 
biolooS eventt resulting from molecular interactions mediated by a phosphopept.de bmding 
TmSn Tn^?r?vention thus provides a method and reagents tor inhibiting (totally or partly) the 
^Sion between a protein containing a phosphopeptide binding domain and a natural ligand 
mlrTto J e T oro«m which norma.ly binds in a cell to the SH2. PID or other phosphopeptide 
Sng S^aWo B .ca, acJJy mediated by such interaction. 

inhibition can be used to study the biological role of the s.gnal.ng event and the cell and moleculer 
biology ol the affected signal transduction pathway. 

An inhibitor identified by the method of this invention can be formulated into a 
pharmaceutical composition containing a pharmaceutical* acceptable earner and/or other 
P xc7ent<s) ustg conventional materials and means. Such a composKic in 
animal either human or non-human, for prevention or treatment of a d.sease or condition 
™mng ft£ ,cX.ar "events involving a phosphopeptide/PBD-mediated 9"*™%^ 
interaction Administration ol such composition may be by any conventional route P aren ' er ^ 
ora, inhalation, and the like) us,ng appropriate formulations as are we., known in this art. The 
^ibitor of this ,nvent,on can be employed in admixture with conventional exc.p ents^ ..e 
pharmaceutical* acceptable organic or inorganic carrier substances suitable for pa enteral 
aommis"aS inhibitors first identHied by these methods. P**™"^^^^^ 
containing such inhibitors, and pharmaceutical methods involving adm °f££* 
i«hihit«r« to oatients tor the treatment or prevention of diseases mediated by the binding 
££££ TnZX o^rnng phosphopeptide binding pairs is therefore an important ob|ect 

01 ^'oSe; aspects and advantages of the present invention are described further in the following 
detailed description of the preferred embodiments thereof. 

Brief Description of the Drawings 

PIG 1 denicts various assay configurations of this invention. 

RG 2 2 a tar on£ "depicting the results of a zeta (Q tTAM-tandem ZAP SH2-dependen 
two-hybrid 'ofmaSonTmammalian cells. Secreted alkaline phosphatase (SEAP) activity JRU) is 
nTon"d aoa inst cells which received the following ONAs. which correspond to the columns m the 
grapm fl) Mrrter DNA. (2) pMSZ (Exampie 1D). (3) pMSZ and pMAZ22 (Example 1E). (4) 
nMS (GAL4 DNA binding domain-vSrc kinase), and (5) pMS and pMA^zz. 

P ( Fl£ 3 Hlus^ £ astrogen-regulated assay system The bar 9^ m ^VsH 2 e dep1nden, 
«K«or«, 0 in the nresence (+) and absence (-) of estrogen on Zeta ITAM-tandem ZAP SH2 dependent 
£^ l2i5^^ GS.L-2 HT10B0 celts. SEAP activity is plotted against cet.s 
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which received plasmid DNAs in duplicate, i.e., cols. 1 and 2 are carrier DMA; cols. 3 and 4 are 
pMSZ; cols. 5 and 6 ar pMSZ and pMAZ22; cols. 7 and 6 are pMerSZ (Example 1 F); and cols. 9 and 
10 are pMerSZ and pMAZ22. 

FIG. 4 is a bar graph of a ZAP two-hybrid assay plotting SEAP activity units vs. four stable 
clones A through D. each grown with (dark bar) or without (light bar) inducer (estrogen). 

FIG. 5 is a bar graph plotting the time course of SEAP production (RU) in Clone D with 
inducible Zeta ITAM-tandem ZAP SH2-dependent two-hybrid activity vs. time following estrogen 
addition to media from 0 minutes to 24 hours. The graph also illustrates that parent ceils 
(unengineered G51L-2 HT1080 cells) do not produce SEAP when cultured in the presence (+E) or 
\ absence (-) ot estrogen. 

FIG. 6 is a bar graph plotting SEAP activity (RU) vs. specific Beta ITAM-Src SH2-dependent 
two-hybrid formation in mammalian cells transfected with plasmid DNAs, which correspond to the 
columns in the graph: (1) pMSB (Example 1K); (2) pMSB and pMAS2 (Example 1N): or (3) 
pMSB and pMAZ22. 

FIG. 7 is a bar graph plotting SEAP activity (RU) vs. estrogen regulation of Beta ITAM-Src 
SH2-dependent two-hybrid formation in mammalian cells. The presence (♦) or absence (-) of 
estrogen is indicated. The columns represent cells transfected in duplicate with pMerSB 
(Example 1L) [Cols. 1 and 2) or with pMerSB and pMAS2 (Example 1N) [Cols. 3 and 4]. 

FIG. 8 is a bar graph plotting SEAP activity (RU) of a stable mammalian cell line containing 
i an inducible GAL4 -estrogen receptor l.b.d.-VP16 transcription activator in the presence (+) or 
absence '(•) of estrogen. 

FIG. 9 is a bar graph plotting SEAP activity (RU) vs. estrogen regulation of Beta ITAM-Fyn 
SH2-dependent two-hybrid formation in mammalian cells observed by transient transaction ( FIG 
9. panel A. columns 1-4 of the graph) or when using a stable cell line (FIG 9. panel B. column 5 
and 6). The presence (+) or absence (•) ol estrogen is indicated. The columns represent cells 
transfected in duplicate with pMerSB (Example 1L) [Cols. 1 and 2) or -with pMer SB and pMAF2 
(Example 1P) (Cols. 3 and 4]. Columns 5 and 6 represent a stable cell line (prepared as described 
in example 2) containing both pMerSB and pMAF2 cultured in the absence (column 5) or presence 
(column 6) of estrogen. 

Detailed Description of the Invention 

The present invention addresses current needs in the art of drug discovery by providing a 
mechanism-based, cellular two-hybrid assay tor identifying inhibitors of key molecules involved 
in certain signal transduction pathways. This assay permits the determination of whether a test or 
unknown compound is capable of blocking a specific phosphopeptide-mediated association, and 
provides compositions and methods for identifying and functionally characterizing such inhibitors. 
This invention permits the identification of inhibitors of the molecular association between an SH2, 
PID or other phosphopeptide binding domain of a given protein and a naturally occurring ligand 
therefor. The inhibitors so identified are themselves useful in therapeutic and pharmaceutical 

• compositions and regimens. 

Among other features, this invention provides genetically engineered cells, preferably 
eucaryotic cells, useful tor high throughput identification of compounds which interfere with, 
inhibit or otherwise impede interactions mediated by the binding of a tyrosine-phosphorytated 
ligand with its phosphopeptide binding protein. The cells of this invention contain and are capable of 

; expressing a reporter gene and genes encoding two fusion proteins. One of the fusion proteins 
contains a phosphopeptide binding domain, the other contains a ligand tor that PBD. The two fusion 
proteins are capable of binding to each other and when they do so. of detect ably activating the 
transcription of the reporter gene. 
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One or both of the fusion proteins may also contain one or more optional domains r laments, 
eluding a domain capable f regulatably sequest ring the fusion protein at a desired c Mufar 
cation or compartment or in an inactiv conformation. 

In the practice of this invention, cells containing the components mentioned above are 
jltured under suitable conditions and in a suitable culture medium permitting cell growth and a 
•tectable level of expression of the reporter gene. A test substance is added to the medium, the 
?lls are cultured for an appropriate incubation period, i.e. permitting an inhibitor to enter cells, 

ck two-hybrid formation or persistence and effect a detectable change in reporter gene 
<pression. The level of expression of the reporter gene is then measured. A decrease m the level of 
cpression of the reporter gene indicates that the test substance is a candidate inhibitor. 

The assay may be repeated using a control cell containing the reporter gene and its associated 
jntrol elements and further containing and being capable of expressing a gene encoding a 
onscription factor fusion protein as a positive control. The transcription factor fusion protein 
3 ntains both a ONA binding domain and a transcriptional activating domain permitting it to actuate 
anscription of the reporter gene. The DBD and TAD are typically the same domains which are 
9 paratery present in the respective DNA-binding and transcriptional activating fusion proteins, 
reviously described. Alternatively, the control cell may express in place of the transenption 
ictor fusion protein, two alternative two-hybrid fusion proteins which were designed based on a 
rtterent protein binding pair than the phosphopeptide binding pair represented by by the pnmary 
vo-hybrid assay cells as described above. The failure of the test substance to significantly alter 
>e level of expression of the reporter gene in such controls supports the conclusion that the test 
ubstance is not generally inhibiting the assay or inhibiting the assay read-out via an irrelevant 
i chanism. Such controls help confirm the phosphopeptide binding inhibitor status of candidate 
lhibitors. 



Definitions 



To facilitate understanding of this invention, the following definitions are provided: 
A "peptide ligand" as used herein defines a polypeptide motif or fragment thereof which 
ontains one or more tyrosine residues, and which is capable of binding to a phosphopeptide binding 
omain such as an SH2 or PID domain when at least one of the peptide ligands tyrosine residues is 
hosphorylated. A peptide ligand may be an oligopeptide, polypeptide or protein fragment or 
ortion (The terms •polypeptide" and "protein* are used interchangeably herein.) 

A 'phosphopeptide binding domain" (PBD) is an amino acid sequence or polypeptide region, 
/pically present in certain signaling proteins, which is capable of selective and specific binding to 
i characteristic tyrosine -phosphory late d ligand. The binding of a tyrosine-phosphorylated ligand 
d a PBD is considered to mediate or direct the association of the proteins and the transduction of the 

rttraceliular signal. . 

A -phosphopeptide binding pair" (PBP) consists of a PBD and a peptide ligand therefor. 

An "SH2 domain- is an example of a PBD. An "SH2 ligand- is an example of a peptide ligand. 

A -phosphopeptide binding pair inhibitor' is a compound which can be shown to to inhibit the 
ormation or persistence of binding between a PBD and a tyrosine-phosphorylated peptide ligand 
herefor. A phosphopeptide binding pair inhibitor may function by binding to a PBD with 
:ompetitive avidity vis-a-vis a tyrosine-phosphorylated peptide ligand for the PBD. binding to a 
yrosine-phosphorylated peptide ligand tor the PBD with competitive avidity vis-a-vis the PBD. 
or otherwise disrupting the normal binding between a PBD and a tyrosine-phosphorylated peptide 
igand. 

A "DNA Binding Domain" (DBD) is a polypeptide or protein domain, e.g. a portion of a 
transcription factor, which binds to a DNA sequence, typically a transcription regulatory DNA 
element linked to a target or reporter gene. 
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A Transcription Activator Domain- (TAD) is a poiypeptid or protein domain, e.g. a portion 
of a transcription factor, which when in proximity to transcription regulatory DNA elements of a 
target gene, activates gene transcription (and is typically linked to a nuclear localization 
sequence). 

5 A •reporter gene" is a gene which upon expression, produces a detectable gene product. 

The term "DNA element" is used to refer to transcription regulatory DNA sequence, typically 
upstream of a target or reporter gene, which binds to the DBD and permits transcription in the 
presence of an appropriate transcription factor or two-hybrid complex. 

A "DNA binding fusion protein" is a fusion protein composed of at least one copy of a DBD fused 
0 to at least one copy of a member of a PBP. A "Transcription activator fusion protein" is a fusion 
protein composed of at least one copy of a TAD fused to at least one copy of the other member of a 
PBP. 

1 1 . The Engineered Cells of the Invention 

5 

A The Host Ceils 

Any eukaryotic cells which can be transformed with heterologous DMA and which can be grown 
or maintained in culture may be used to prepare the novel engineered ceils of this invention. 
However, mammalian ceils are presently preferred. Mammalian cells include, without limitation. 

0 those of mouse, hamster, rat. rabbit, dog, cow or primate, including human, origin. They may be of 
a wide variety of tissue types, including without limitation, mast cells, fibroblasts, osteoclasts, 
osteoblasts, macrophages, neutrophils, other T cells, epithelial celts, endothelial cells, hepatic' 
cells, kidney cells, or other cell types and may be primary cells or cell lines. 
Yeast cells may also be used as the host eukaryotic cells of this invention. 

5 The host ceils of this invention are engineered by the introduction therein of heterologous 

DNA(s) encoding and capable of directing the expression of the fusion proteins as well as 
heterologous DNA comprising a reporter gene under the transcription control of a DNA element 
responsive to the "two-hybrid" complex of the DNA-binding fusion protein with the transcription 
activator fusion protein. 

) The DNA molecu»e{s) encoding the two fusion proteins, as well as DNA comprising the 

reporter gene and its associated DNA element, may be introduced into the host cells as separate 
DNAs or as one or more DNAs linked together. 

B. Reporter Gene Constructs 

Suitable reporter genes include any gene whose expression can be detected. Many such genes 
are known to the art. including, without limitation, those encoding an enzyme which catalyzes a 
reaction resulting in a detectable color change, an enzyme that permits continued cell growth or 
viability in the presence of an otherwise toxic agent or absence of an otherwise essential nutrient, 
a repressor or suppressor of a biological function which is activated or enhanced by the reduced 
» expression of the "reporter* gene or simply a gene product which can be conveniently detected. 

Examples of reporter genes which may be used in the practice of this invention include, 
without limitation, genes encoding enzymes such as luciferase. beta-galactosidase. alkaline 
phosphatase and horseradish peroxidase. The proteins encoded by these genes catalyze reactions 
generating readily detectable products. Alternatively, the reporter gene may encode a readily 
detectable cell surface marker such as Thy1 or a secreted protein such as hGH for which there is a 
convenient, and preferably commercially available, assay. One well known reporter gene, which is 
exemplified herein, is the gene encoding secreted alkaline phosphatase (SEAP). [Berger J et al 
Gene. 66:1-10 (1988)] 
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The production of SEAP may be monitored from the culture media without the need to lys the 
*lis. This permits conv nient det ction, using a conventional colorimetric or floor scent assay, at 
Hfferent time points during the assays t the invention. 

Tne reporter gene is linked to a DNA tement recognized by the DBD and permitting gene 
jxpression in response to two hybrid formation (i.e., complexation of the first and second fusion 
>r teins). The DNA element may comprise a synthetic promoter sequence consisting of multiple 
DBD binding sites upstream of a TATA box element. The number of such binding sites can be 
idjusted to optimize the overall level of reporter gene expression induced by the fusion proteins of 
his invention Examples of such binding sites include GAL4 binding sites (see e.g. Acheson. "DNA 
^umor Viruses-. Cold Spring Harbor, NY, (Tooze. J. ed., 1980) pp 151-160: Peden et al. Science. 
'09:1392-1396 (1980); Liu & Green. Cell. 61:1217-1224 (1990); and Ptashne & Gann, 
Mature. 346:329-331 (1990)]. 

The DNA element of the reporter gene, when introduced into cells. wHI preferably not oe 
sccupied in the absence of exogenously added DBD of the second fusion protein. The extent of 
occupancy of the DNA element may be determined by measuring the level of reporter gene 
ranscription in the absence and presence of the the fusion proteins, the kinase domain or the 
ranscription factor fusion protein. 

By way of illustration, one particular embodiment of the invention uses a reporter gene 
construct containing DNA encoding SEAP linked to a DNA element containing iterated GAL4 binding 
sites Additional guidance in the use of suitable promoters for this reporter gene may be obtained 
rom Fields et al. US Patent No. 5.283.173 (1 Feb 1994) and Vasavada et at, Proc. Natl. Acad. Sci.. 
JSA, 88:10686-10690 (1991). 

C Fusion Protein Constructs 

The two fusion proteins of this invention typically contain either (a) one or more DNA 
Dinding domains ("DBDs". which may be the same or different), or (b) one or more transcription 
activation domains (TADs", which may be the same or different), in addition to one or more 
Dhosphopeptide binding domains or peptide ligands therefor. For instance, in one embodiment, the 
first fusion protein has an SH2 domain or a tandem SH2 region (two SH2 domains) and a 
iranscription activation domain, while the second fusion protein has a DNA binding domain and an 
SH2 ligand domain, capable after tyrosine phosphorylation, of binding with the SH2 domain(s): 

• 1st fusion protein: SH2 domain-TAD 

2d fusion protein: DBD-SH2 ligand 

In preferred embodiments, the second fusion protein contains a hormone ligand binding domain and 
a protein tyrosine kinase domain linking the DBD and SH2 ligand domain, as discussed in further 

detail below. , ^ 

The DBD and the TAD of the two fusion proteins typically originate from a transcnption 
factor The DBDs and TADs may be derived from the same or different transcription factors and may 
be optimized by genetic engineering. DBDs and TADs are often derived Irom transcription factors 
having separable DNA-binding and transcription activation domains. 

i. Transcription Factors 

A large number of transcription factors are known which require two subunits for activity, 
that ts. a single transcription factor can be divided into two separate functional domains (e.g. a TAD 
and a DBD). While each domain is inactive (transcriptionally) by itsell. the paired components, 
when in close proximity, comprise an active transcription factor. 
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Transcription factors which can be used in this invention include y ast GAL4. which can be 
divided into two domains [Ma and Ptashne. Cell. 48: 647-853 (1987) and US Patent No 
5.283.173]. One may use. for instance, a fusion of GAL4{1-147)-SNF1 and 
SNF4-GAL4(768-881), where the SNF1 and GAL 4 may be replaced by the subject binding 
proteins as binding domains. Combinations of GAL4 and VP16 or HNF-1 and VP16 can be employed. 
Each of these proteins will have a linked partner, part of an interacting pair. The interaction 
(binding partner recognition) will bring the TAD (VP16) to DNA (GAL4 or HNF-1) and 
transcription will be induced. Other transcription factors that may be used for the method of this 
invention include, but are not restricted to. the members of the Jun. Fos. and ATF/CREB families. 
) Octl. Sp1. HNF-3, the steroid receptor superfamily. and the like. Other TADs and DBDs which may 
be used in the practice of this invention are well known in the art 

The transcription factor can be endogenous or exogenous with respect to the desired host cells. 
If the transcription factors are exogenous, but functional within the host and can cooperate with the 
endogenous RNA polymerase (rather than requiring an exogenous RNA polymerase, for which a gene 
could be introduced), then an exogenous promoter element functional with the DBDs or TADs of the 
transcription factor can be provided with another construct for regulating transcription of the 
reporter gene. Thus, the initiation of transcription can be restricted to the gene(s) associated with 
the exogenous promoter region, i.e.. the reporter gene(s). 

i i . DNA Binding Domains 

DNA-bmdrng domains tor use in this invention may be selected from a wide variety of DNA- 
binding domains known in the art. Examples include DNA-binding domains of the homeodomain 
class, the zinc-finger class, and the paired-box class, for which numerous examples are known in 
the literature. In cases where detailed information on the molecular contacts of the protein with 
DNA is available, the DNA recognition specificity of the protein may be engineered by amino acid 
substitutions. 

A preferred feature for DNA binding domains is the lack of interactions with other cellular 
proteins. Alternatively, the nature of such interactions should be known with precision, so that 
these interactions can be abrogated by suitable amino acid substitutions within the DNA-binding 
domain. In addition, the DNA binding domain will preferably be unique for the cell type utilized. The 
DNA binding element, when introduced into cells, will preferably not be occupied (at least not 
unduly) in the absence ot exogenously added DNA binding domain. The level of background occupancy 
may be determined through appropriate controls as discussed previously. 

Cell-type specific factors (not found in the cells used for the in vivo assay), DNA binding 
domains with altered binding specificity and procaryotic DNA binding domains are well suited for 
the methods described. One suitable DNA-binding domain is derived from the Phoxl protein 
(Grueneberg, D. A., Natesan. S.. Alexandre, C. and Gttman. M. Z. (1992). Human and Drosophila 
homeodomain protein the enhance the DNA-binding activity ot serum response factor. Science 257, 
1089-1095; Genbank accession number: M95929). This protein is a member of the homeodomain 
class. A 69 amino-acid domain derived from Phoxl is sufficient to bind DNA. High-affinity DNA 
recognition sequences for the protein have also been identified, as have amino acid substitutions 
that change its DNA recognition specificity and that affect its ability to interact with certain 
endogenous proteins in human cells. (Grueneberg et at) 

A second suitable DNA-binding domain is derived from the SRE-2BP protein (Attar. R. M. and 
Giiman. M. Z. (1992). Expression cloning of a novel zinc-finger protein that binds to the c-fos 
serum response element. Mol.Cell. Biol. 12, 2432-2443; Genbank accession number: M88579). 
SRE-2BP is a member of the C2H2 class ot zinc-finger proteins. It has seven tandem zinc fingers. 
Any one of these zinc-finger domains or any combination of two or more domains can be used to 
generate a DNA-binding domain with novel recognition specificity. Furthermore, because the 
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general structure and mode of DNA recognition is known for proteins of this ciass, DMA r cognition 
can be directly modified rf nec ssary. 

A third suitable DNA-binding domain is the GAL4 binding domain r t rred to previously. 

A fourth suitable type of DNA-binding domain is a composite DNA binding domain as described 
in published International Patent Application No. WO 96720951 and exemplified by 2FHD1. See 
also Pomeranz et al. 1995, Science 267:93-96. 

iii. Transcription Activation Domains 

Preferred transcription activation domains are compact in size, potent in activity, and 
nontoxic to cells. Additional features become relevant for particular applications. For example, tor 
constructs to be used in one particular type of cell or tissue, activator domains with especially high 
activity in the host cell are used. In applications that involve the stable integration of the target 
gene into host cell DNA. activator domains that resist chromatin suppression are used. 

Numerous activation domains are known m the art. Two illustrative activation domains which 
may be used int he practice of this nvention are the VP16 activation domain and the NF-kB p€5 
activation domain. See e.g. International Patent Application Nos. WO 96/20951 and WO 96/41865. 

iv. Phosphopeptide Binding Pairs 

Each of the fusion proteins contains either at least one TAD or at least one DBD linked to at 
least one member of a PBP, i.e. to at least one phophopeptide binding domain or peptide ligand 
therefor One exemplary PBP is an SH2 domain and a peptide ligand therefor (e.g., a ZAP SH2 
domain and an immunoreceptor activation motif such as the TCR zeta ITAM). As noted previously, 
the ligand domain will contain one or more Tyr residues, which must be phosphorylated to bind to 
the SH2 domain. By way of further example, the peptide ligand domain may be an IgE ITAM. the 
polyoma middle T phosphorylation sequence or a synthetic consensus tyrosine phosphorylation 

- binding site. — — — . — 

In some embodiments, inclusion of protein domains that are not generally regarded as 
influencing, either ailosierically or otherwise, the behavior of the SH2 or PID or other such 
domain may be included to stabilize the folding, enhance expression or provide a means of 
identifying or manipulating the protein (e.g. fusion to glutathione- S-transterase or other domains 
that aid in identification/purification, epitope tags. etc). 

As illustrated below, the fusion proteins may contain a plurality of PBDs or hgands 
therefore. For example, one embodiment of the present invention uses a fusion protein containing a 
tandem SH2 domain from ZAP-70 (containing two SH2 domains). 

Peptide ligand(s) members of the PBP tor use in the fusion proteins are defined above. For 
example, where one member of a binding pair member is an SH2 domain, the other member, i.e.. 
the peptide ligand. may be a naturally occurring ITAM containing at least one Tyr. Peptide ligands 
in their nonphosphoryiated state will not bind the PBDs. These ligands must be phosphorylated by 
endogenous or (introduced) hetterologous protein tyrosine kinase activity within the cell. 

A large number of SH2 domains, PIDs. and other phosphopeptide binding domains, as well as 
ligands therefor, are known in the art and may be adapted tor use in this invention. The following 
additional background information and guidance may be of interest and use to the practitioner. 

Identification of SH2 or SH2-like Domains 

The term ■SH2-Iike domain or a subdomain thereof" refers to a sequence which is 
substantially homologous to a Src homology region 2 (SH2 region), or a subdomain of an SH region 
preferably a conserved region of an SH region. The Src homology region is a noncatalytic domain of 
-100 amino acids which was originally identified in the viral Fps and viral Src cytoplasmic 
tyrosine kinases by virtue of its effects on both catalytic activity and substrate phosphorylation (T. 
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Pawson, Oncog ne 3. 491 (1988) and I. Sadowski et a!.. Mot. Cell. Biol. 6. 4396 (1986)). SH2 
domains have been found in a variety of eukaryotic proteins, some of which function in 
intrac llular signal transduction. Many are known inth art. Exampl s (including counterparts 
from various species) of SH2 domain-containing prot ins include (1) members f the src-farntty 

5 protein tyrosine kinases (Src, Lyn, Fyn, Lck, Hck, Fgr, Yes), (2) She (3) Tsk, (4) Btk, (5) VAV, 
(6) Grb2. (7) Crk. and (8) signal transducer and transcription (STAT) proteins. In addition, a 
number of proteins, such as ZAP-70. p85 phosphatidyl inositol 3* kinase (PI3K), Syk, GTPase 
Activating Protein (GAP), and Phospholipase C gamma, have two SH2 domains. SH2 domain- 
containing proteins have been identified in human, rodent, sheep, bovine, C. elegans. DrosophHa, 

0 Xenopus, flatworm. freshwater sponge, and hydra. 

One way to identity new SH2 or SH2*iike domains from unknown DNA. RNA or protein 
sequence is by using one of many available computer alignment programs. One example is pfscan. 
which can be run via the World Wide Web (WWW) site at 

http'7/ulrec3.unit.ch/software/profilescan.html. To use the program, a protein sequence is tested 
5 against a "profile" describing the SH2 domain motif. According to the program information, the 
particular strength of profiles is that they can be used to describe very divergent protein motifs. 
These profiles are normally derived from multiple alignments of the initial sequence set. In 
addition to the sequences themselves, a profile identifies which types of residues are allowed at 
what position within the domain, which amino acids are conserved, which ones are not, which 
:0 positions or regions can allow insertions, and which regions may be dispensable. Additional 
information on Pfscan and PROSrTE can be obtained at the web page 

http://ulrec3.unil.ch/index.html operated by the Bioinformatics Group at the ISREC (Swiss 
Institute for Experimental Cancer Research). 

As an example we analyzed the peptide sequence of human Src with the pfscan program. The 
5 results are shown below. The program clearly identified the SH2 domain of Src as encompassing the 
- region from amino acids 150-247 of the Src peptide sequence. In addition, the SH3 and kinase 
domains were identified by pfscan. 

NScore raw trom-to Profile I Description 

0 

26.9695 1792 pos. 150-247 PS50001 | SH2 Src horology 2 (SH2) domain 

20.2947 1182 pos. B3-144 PS50002 j SH3 Src horology 3 (SH3 ) domain 

43.4246 2912 pos. 269-522 PS50011 j PROION_.KINASEL.DC2«I Protein kinase 

5 The NScore of a match is the negative decadic logarithm of the expected number of matches of the 
given quality (or better) in a random database of the given size. For NScores «1 this converges to 
the probability of finding the match in the database. Since the number of expected matches depends 
on the size of the database, the decadic logarithm of the database size must be subtracted before the 
calculation: 

0 -log(NExp) = NScore - tog (DBsize) 

where (NExp= Expected number of chance matches) and (DBsize^size of the database in 
characters). 

The following table gives somes examples on how to convert the NScores into probabilities for 
5 the SwissProt database and the nonredundant (nr) protein database. The calculation is based on a 
database size of 

16,531,385 residues for SwissProt (log«7.27) 
58.154,119 residues for the nr database (log«=7.76) 
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NScore 



Expected chance matches in: 
Swiss Pro t: nonredundmt 



7.0 
7.5 
8.0 
8.5 
9.0 
9.5 
10.0 
10.5 



1,8 5.B 
0.58 1.82 
0.18 0.58 



0.058 0.182 

0.018 0.058 

0.006 0.0182 

0.0018 0.0058 

0.0006 0.0018 



...and so on... 

The segment of a test sequence contains an SH2 domain with an SH2 profile NScore vaJue > 7.5. 
preferably > 6. more preferably > 9, more preferably > 10. 

As a second example, the N-terminal 160 amino acid sequence from human ZAP-70 was 
applied to ptscan. The result indicated an SH2 domain bounded by amino acids 10-102. 

NScore raw f rem- to Profile | Description 

16.4402 1082 pos. 10-102 PS50001 | SH2 Src homology 2 (SH2) dcmaixi 

The minimal segment encoding an SH2 or SH2-ltke domain as determined by sequence 
alignment may not be enough to function or function optimally in the assay. Additional sequences 
either N-terminal and/or Oterminal of this minimal segment may be necessary or desirable for 
optimizing the binding of the encoded protein to a peptide ligand within the context of the invention. 
These_addrtipnal. sequences may be derived from the. natural protein, sequence or may be derived 
from other proteins or even non-natural sequences that are added to the expression vector. 
However, the sequence required can be readily determined biochemically, in an in vitro binding 
assay, or genetically, using the production of SEAP as an indicator of SH2 domain-phosphopeptide 
interaction. 

SH2 domains can be identified using other computer alignment programs, such as MegAlign 
within the DNAstar computer package (Madison. WI). To do this, one or more known SH2 domains 
and a test sequence are aligned by the clustat method. A sequence having £ 25%, in some cases 30 - 
50 %. in other cases > 50%. amino acids identical to a known SH2 domain is identified as an SH2 
homology domain. The positions of identical amino acids between the test sequence and different 
known SH2 domains can vary, except for one position. All SH2 domains identified to date have a 
conserved arginine residue approximately 25-40 residues from the start of the SH2 homology 
domain. In human src this arginine is found within the sequence FLVRES. where abbreviations tor 
the amino acid residues are: F, Phe; L. Leu; V. Vat; R. Arg; E, Glu; S. Ser. 

Another way to identify SH2 or SH2-like domains is by running a query in the federated 
nucleotide or protein databases tor the SH2 domain feature. In the SWISS-PROT database, this is 
listed under the FT or "feature" heading. SWISS-PROT database can be accessed over the WWW at 
EBI http://www.ebi.ac.uk. For example, in the file listed for human Src (P12931), the region 
containing the SH2 domain is shown to be 150-247. 
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SWISS-FROT: P12931 

ZD SRC_JHtM&N STANDARD; PRT; 535 AA. 

AC P12931; 

EF MM; 190090; 

5 IK FRQSITE; PS00107; FRCTEIN_#INASE_*TP . 

ER PROSITE; PS00109; FROTEINJra*RSELTYR. 

ER FRQSITE; PS50001; SH2. 

ER PROSITE; PS50002; SH3 . 

ER PROSITE; PS50011; ITOION_jaNASELJXW. 

0 ER PRDOl (Domain structure / List of seq. sharing at least 1 dcmain] 

ER SWISS- 2DPAGE ; GET REGICU CN 2D PAGE. 

KW TRANSFERASE; TYROSINE- PROTEIN KINASE; PRGTD-CNO0GENE; ffC^FHJK YLATKXJ ; 

KW ATP-BINDING; MYRISTYLAnON ; SH3 DCMAIN; SH2 DCMAIN. 





FT 


BUT _MET 


0 


0 


BY SIMILARITY. 


5 


FT 


LIPID 


1 


1 


MYKLSSVXE (BY SIMILARITY) . 




FT 


DCMAIN 


83 


144 


SH3 . 




FT 


DCMAIN 


150 


247 


SH2. 




FT 


DCMAIN 


269 


522 


PKUIIlIN KINASE. 




FT 


NP_BIND 


275 


283 


ATP (BY SIMILARITY) . 


0 


FT 


BINDING 


297 


297 


ATP (BY SIMILARITY) . 




FT 


ACT_SITE 


388 


388 


BY SIMILARITY. 




FT 


MXLRES 


419 


419 


PHOSPHORYLATION (AUTO) (BY SIMILARITY) 




FT 


MXLRES 


529 


529 


FH3SHCKYLATTCN (BY SIMILARITY) . 



5 Yet another way to identify SH2 or SH2-like domains may be accomplished by screening a 

cDNA expression library with a phosphorylated peptide ligand tor a known SH2 domain to isolate 
cDNAs tor SH2 proteins. One could use PCR or low stringency screening with an SH2-specific 
probe. The SH2 domain or protein containing the SH2 domain may be isolated from naturally 
occuring sources (e.g. cells, tissues, organs, etc); produced recombinantly in bacteria, yeast or 

0 eukaryotic cells; produced in vitro using cell tree translation systems; or produced synthetically 
(e.g. peptide synthesis). 

Certain SH2 or SH2-like domains may not be identified via the pfscan program nor exhibit 
significant homology with known SH2 domain sequences to be detected by computer alignment 
programs. These sequences may, nevertheless, exhibit the same or simHar three -dimensional 

5 structure as known SH2 domains and function as an SH2-like domain and function to bind 

phosphotyrosine-containing peptides or proteins. The three-dimensional structure of several 
known SH2 domains have been determined. SH2 domains are characterized as two anti-parallel beta 
sheets composed of 5 or 6 beta strands. Regions forming an alpha heHx may or may not be present 
within the domain. SH2 or SH2-like domains may be recognized as having an SH2-like domain 

0 structure when solved by x-ray crystallography or NMR spectroscopy. Alternatively, a predicted 
structure by homology modeling may be used to identify a particular protein sequence as an SH2- 
like domain. 

The alignment of SH2 domains used to generate the SH2 profile for pfscan. as taken from 
http://ulrec3.unil.ch/prf_details/alignments/SH2.msf (profile matrix can be obtained from 
5 http://ulrec3.unil.ch/cgi-bin/get_pstprf7SH2) is based on alignment of approximately 390 SH2 
domains from proteins of various species. The list of proteins containing SH2 domains used in the 
alignments in the Swiss-Prot Database includes the following (P##### is the Swiss-Prot 
Database Accession number): 
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P00519, ABL1_HUMAN P00520, AHLiJCDSE P00521 , AEO^LVAB 
P00522, AELJFCME PO0523 , SRCjCHICK P00524, SRC_RSVSR 
P00525, SRC_AVISR P00526, SRC_RSVP P00527, YE5JWIS* 
P00528 , SRajKME PO0530, FPS_HXJSV P0D541. FPSLAVISP 
P00542. FES_FSVCA P00543, FESJFSVST P00544, FS^FSUSR 
P03949, ABL1JCAEEL P05433, GAGCLAVISC P05480. SROiJCUSE 
P06239, LCK^JfcMAN P06240. ICtfJCUSE P06241. FXNJO«N 
P07332, FESLHtMAN P07947, YESLflUftN P07948, l^HJSMN 
P08103 , HZFLMXEE P08487. PIP4JCV1N PO8630, SRC2_£RCME 
P08631, POUiLKftN P09324, VESJOIICK P09769, FQUftMN 
P09851, GTPKSOJm PI 0447 , ABL_FSVH5f P106B6, PIP4_RAT 
P10936, YES_XENLA P12931, SRC.HMN F13U5, SRCLJCENLA 
P13116. SRC2_ J XENLA P13406, FXNJCENLA P14084, SRCLAVISS 
P14085, SRC_?MIST P14234, FC3^JCUSE P14238, FESLpHJCA 
P15054. SRC_AVIS2 P15498, VAV_HUMAN P16277, BLK^CUSE 
P16333, WXHIM^ P16591. FES^JJMPN P16879, FES_HXISE 
P16885, P17713 . STCT^JMftX P1B106. FPSJRCKE 

P19174, PIP4_JiLMttJ P20936. GITA_HUMAN P23615, SPT6JXEAST 
P23726, P85B_BC772N P23727, P8S^_BCVIN P24135. PIP5_BAT 
P24604, TECJXUSE P25020, P25911, LVIOCUSE 

P26450, P85AJ*£USE P27446, FttOCTFHE P27447, YESL2CTFHE 
P27870, VAV_J4DUSE P27986, P85JOMMJ F29349, C9W_PRGME 
P29350, FIN6_HLMttI P29351, FIN6_frEUSE P29353, SHTJUMAN 
P29354, C5?B2_HIMAN P29355. SE*B_CA£2X, P31693. SRQJISVPA 
P32577, CSK^RAT P34265. YXF1_CAEEL P35235, PINB_JCOSE 

P35991, BIKJOUSE- P39688, FOqjCUSE P40763 7 - SIA3_HLJMAN 

P41239, CSFLCHTCK P41240, CSK_HLMAN P41241. CSK^CUSE 
P41242, CIKJCUSE P41243, CTK.RAT P41499. FTNB_RAT 
P42224, STI7a_Hl>RN P42225. ST?d_MDUSE P42226, SIM.HUMAN 
P42227, SIA3^EJUSE P42228. STA4_MCUSE P42229, S3A5_HUMAN 
P42230, P42231. SIA5_SHEEP P42232 , STAB_£EUSE 

P42679, CIKJEMAN P42680. TEELJJIMAN P42681. INHUMAN 
P42682, TflOOUSE P42683, UCKJC3HCK P42684, AEL2_HLM*N 
P42685, P42686, SRKl^SPCXA P42687, SPKlJXXTtt 

P42686, SRK2_SFCXA P42689. SRK3_SPCLA P42690, SRK4_SF0LA 
P43403, ZA70_flLMAN P43404. ZA70_MXJSE P43405, SCOU«N 
P46108, CRK^JUMAN P46109, CRKL 4 _HLMAN Q00655, SYK^PIG 
Q02977, YFK^rHTTK Q03526, riKJCUSE Q04205. 1ENS_CHICK 
Q04736. VES^CUSE Q04929, CKK^CHICK Q05876, FYN^OUCK 
Q06124, PINB_HLMAN Q061B7 r BIKJJ^MVN Q07014, LYN.RAT 
Q07B83, GRB2JCHICK Q08012, ERKJ3RCME Q08881, TTKJHJtVM 

A general method to identity an SH2 domain within a test peptide or nucleotide sequence follows: 

1. Translate the cDNA or RNA into single letter code protein sequence. This could be accomplished 
using a computer program such as DNA sthder or EdHSeq in the DNAstar package. 

2. Go to the WWW site at http://ulrec3.unil.ch/soltware/prolilescan.html 
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3. Copy the test sequence into the appropriate box in the pfscan form 

4. Submit the form to the pfscan server 

5 5. The r suits are sent back through the web browser r via e-mail. 

The subject invention is relevant to SH2 and SH2-like domains as described in the foregoing 
paragraphs. Using information provided herein, and by analogy to the examples provided below, one 
may carry out this invention with any SH2 domain. SH2-!ike domain. PID or PID-like domain and 
0 a peptide ligand therefor, e.g. in place of ZAP. Syk. Src or Fyn SH2 domains. 

Identification off PID or PID-like Domains 

An alternative phosphotyrosine binding domain to SH2 domains is the so-called 
phosphotyrosine interaction domain (PID). This domain, containing on average about 160 amino 

5 acid residues, was originally identified in the She protein. In contrast to SH2 domains, which 
recognize sequences having a consensus pTyr-Xaa-Xaa-Xaa-Xaa (a phosphotyrosine followed by 
three or more amino acids). PID domains recognize sequences with the consensus Asn-Xaa-Pro- 
pTyr (also called NPXY in single letter code). The invention described in this application is also 
relevant to PID and PID-like domains. In this case, the coding sequence for a PID domain is 

0 substituted in the appropriate vector tor the SH2 domain coding sequence and a ligand that 

recognizes the PID domain replaces the SH2 domain ligand. Phosphorylation of the PID ligand could 
be accomplished using v-Src. as described herein. Alternative protein kinases could be used to 
phosphorylate the PID ligand. In addition, a protein kinase endogenous within the cell could catalyze 
phosphorylation of the PID ligand. 

5 Significant information concerning these domains is known in the art. A detailed description 

of the PID domains can also be found onjhe WWW at the site http://www.bork.embl- 
heidelberg.de/Modules/pid-grf.html. The following information is taken from that site: 

Documentation - PROSITE description 

0 

Beside SH2.the phosphotyrosine interaction domain (PI domain or PID)[3J is the second 
phosphotyrosine-binding domain found in the transforming protein She [1,2]. She couples 
activated growth factor receptors to a signalling pathway that regulates the proliferation of 
mammalian cells and it might participate in the transforming activity of oncogenic tyrosine 
5 kinases. The PI domain specifically binds to the Asn-Pro-Xaa-Tyr(p) motif found in many 

tyrosine-phosphorylated proteins including growth factor receptors. PID has also been found in 
the She related protein Sck [1] and several otherwise unrelated regulatory proteins [3] which 
are listed below. 

0 • Mammalian She (46 kD and 52 kD isoforms) contains one N-terminal PID. a collagen-like 

domain and a C-termtnal SH2 domain. 
* Human She related protein Sck contains one PI domain and a SH2 domain. 

Mammalian X1 1 is expressed prominently in the nervous system, it contams 2 disc 

homologous regions (DHR) of about 100 AA downstream of the PID. 
5 • Drosophita nuclear Numb protein is required in determination of cell fate during sensory 

organ formation in drosophila embryos. It has one PID. 

Caenorhabditis hypothetical protein F56D2.1 contains an N-terminal metalloproteinase 
domain followed by one PID. 
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Rat FE65. The WW domain as well as the 2 PIDs found in th sequence of FE65 indicate that 
this prot tn is probably involved in signal transduction. 

Drosophila protein disabled is a cytoplasmic, ty rosin phosphorytated protein found in 
CNS axons and body wall musd s. It is involved in embryonic neural d v lopment. It 
contains one N-terminal PI domain. 

Mouse mitogen responsive phosphoprotein isoforms P96, P93 and P67 which are produced 
by alternative splicing, contain one N-terminal PID. This is also true for the differentially 
expressed human ortholog Doc-2. 
Human EST05045 protein fragment has one PID. 

References: 

1] Kavanaugh W.M.. Williams L.T. Science 266:1862-1665(1994) 
2) Blaiki, P. et al., J.Biol.Chem. 269. 32031-32034 (1994) 
; 3) Bork P.. Margolis B. Cell 80, 693 (1995) 

b PI domain alignment based on approximately 40 PI domains from various species is illustrated in 
he WWW site at http://ulrec3.unil.cn/prf_details/alignments/PID.msf. An alignment based on 
approximately 50 PID sequences is shown at the web site at http://www.bork.embl- 
ieidelberg.de/Moduies/pi-ali.html. 

The minimal segment encoding a PID or PID-like domain as determined by sequence alignment 
ray not be enough to function or I unci ion optimally m the assay. Additional sequences either N- 
erminal and/or C-terminal of this minimal segment may be necessary or desirable tor optimizing 
he binding of the encoded protein to a peptide ligand within the context of the invention. These 
additional sequences may be derived from the natural protein sequence or may be derived from 
Mher proteins or even non-natural sequences that are added to the expression vector. However, the 
s quence required can be jdetermmedbiochemically, in an in vitro binding assay, or genetically, 
jsing the production of SEAP as ah indicator of PID domain- pnospho peptide interaction. 

v. Optional Domains 

One or both of the fusion proteins may also contain one or more optional domains, including a 
protein kinase domain, as mentioned above, or a domain capable of regulatably rendering the fusion 
protein unavailable tor 2 -hybrid formation (e.g. by sequestering the fusion protein at a desired 
cellular location or compartment or by maintaining the fusion protein in an inactive 
conformation). 

Kinase Domains. As one example, if the cell does not have an endogenous protein kinase 
zapabie of phosphorylating the tyrosine residues of the peptide ligand for the SH2. PID or other 
phosphopepttde binding domain, a heterologous kinase activity is introduced into the cell. A DNA 
encoding such a heterologous kinase may be introduced into the cell as an independent transcription 
unit (i.e.. with its own transcription regulatory elements, such as a promoter and/or enhancer 
sequence). Alternatively , a kinase activity may be introduced as a functional kinase domain within 
one of the two fusion proteins of this invention. In one embodiment the fusion proteins comprise a 
first chimera containing one or more SH2 domains linked to one or more transcription activation 
domains and a second chimera containing a DBD-kinase domain-pep de ligand domain fusion. 

In one embodiment, the protein kinase domain from the v*Src rotein was used to 
phosphorylate the peptide ligand linked to the DNA binding domain. Alternative protein kinases may 
be used to conduct this phosphorylation. The recognition of protein kinase domains can be conducted 
by the same approaches described for SH2 or PID domains using piscan, ClustalV analysis, 
hybridization, or functional biochemical analysis. A segment defining the protein kinase homology 
region is approximately 280 amino acids. A detailed alignment of protein kinase domains can be 
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obtained at the pfscan w b site listed above r at the protein kinase web site at University of 
Calif mia at San Diego (http://www.sdsc. du/kinases/). The minimal s gment encoding a protein 
kinase domain as determined by sequenc alignment is bounded by a conserved gtycin in the N- 
terminal portion and a conserved arginine at the C-terminal portion. Howev r, additi nal sequences 
5 either N-terminaJ and/or C-terminal of this minimal seqment may be necessary or desirable for 
the encoded protein to functionally phosphorykate the peptide ligand within the context of the 
invention. These additional sequences may be derived from the natural protein kinase sequence or 
may be derived from other proteins or even non-natural sequences that are added to the expression 
vector. The sequence required can be readily determined biochemically, in an in vitro 

0 phosphorylation reaction, or genetically, using the production of SEAP as an indicator of SH2 (or 
PID) domain-phosphopeptide interaction. For domains that recognize phosphothreonine or 
phosphoserine-modrfied peptides, an appropriate protein kinase would be substituted for the v-Src 
kinase domain. 

Localization domains. As another example, fusion proteins of this invention may also 

5 contain a targeting sequence providing for translocation of the protein to the nucleus. Such a 

targeting sequence may have a plurality of basic amino acids, referred to as a bipartite basic repeat 
[see, e.g.. Garcia-Bustos et al, Biochem. Bioohvs Acta Ifl7l;83-1Q1 (1991)). This sequence can 
appear in any portion of the molecule internal or proximal to the N- or Oterminus and results in 
localization of the fusion protein within the nucleus. 

) Nuclear Hormone Binding Domains. Compounds introduced into cells may be labile and 

therefore not present tor long periods of time once introduced into the ceil. In such cases, it would 
be advantageous to have all components present in the cell at the time the cells are exposed to test 
compositions. We therefore have made constructs encoding fusion proteins containing a nucJear 
hormone ligand binding domain (HBD). Numerous such hormone receptors and ligands are known 

S which may be used in the practice of this invention, including estrogen, glucocorticoid, retinoic 
acid, aldosterone and vitamin D receptors. The regulatory properties of a nuclear hormone ligand 
binding domain (Picard et al. Cell (1988) 54. 1073-1080; Eilers et al. Nature (1989) 340. 
66-68) allow the post-translational regulation of interaction of the fusion proteins. By way of 
illustration, an estrogen receptor domain serves as an autonomous regulatory domain: upon fusion 

) to a heterologous protein, the steroid receptor domain subjects the chimeric protein to hormonal 
control. This regulatory property is due to the hormone-reversable interactions of the HBD with 
heat shock proteins. For example, a PBD-TAD fusion protein and a DBD-HBD-kinase-PBD ligand 
fusion protein can be expressed in cells containing the reporter gene construct. Only with the 
addition of steroid hormone will the fusion proteins be able to interact. In the absence of hormone, 

1 the HBD-containing fusion protein will not be competent to interact with its complementary fusion 
protein to form the two-hybrid complex. 

Other domains that may optionally be included within the fusion proteins include an antibody 
recognition sequence (epitope tag) permitting detection using antibodies. Such monitoring can be 
useful to confirm that the fusion protein is made and that the level of expression is not affected by 
i compounds tested for inhibition of phosphopeptide domain binding. 

HI. Assembly of Fusion Protein Constructs, Reporter Gene Plasmids and 
Genetically Engineered Cells of the Invention 

Constructs encoding the fusion proteins and containing the reporter genes of this invention 
can be introduced Into the cells as one or more DNA molecules or constructs, in many cases in 
association with one or more markers to allow for selection of host cells which contain the 
construct(s). The constructs can be prepared in conventional ways, where the coding sequences and 
regulatory regions may be isolated, as appropriate, ligated. cloned in an appropriate cloning host, 
analyzed by restriction or sequencing, or other convenient means. Particularly, using PCR. 
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individual fragm nts including all r pontons of a functional unit may be isolated, where one r 
mor mutations may b introduced using "prim r repair", ligati n. in vitro mutagenesis, etc. as 
appropriate. For xampie. a DNA sequence encoding the DNA-binding domain or TAD(s) is joined to 
DNA encoding th appropriat member of the PBP. These sequences are joined such that they 
constitute a single open reading frame that can be translated tn cells into a single polypeptide 
harboring all intended domains. The order and arrangement of the domains within the polypeptide 
can vary. The construct(s) once completed and demonstrated to have the appropriate sequences may 
then be introduced into a host cell by any convenient or desired means. The constructs may be 
incorporated into vectors capable of episomal replication (e.g. BPV or EBV vectors) or into vectors 
designed for integration into the host cells* chromosomes. The constructs may be integrated and 
packaged into non-replicating, defective viral genomes like Adenovirus, Ad en o- associated virus 
(AAV), or Herpes simplex virus (HSV) or others, including retroviral vectors, for infection or 
transduction into cells. Alternatively, the construct may be introduced by protoplast fusion, 
electroporation, bioltstics, calcium phosphate transfection, lipofection, microinjection of DNA or 
the like. The host cells will in some cases be grown and expanded in culture before introduction of 
the construct(s). followed by the appropriate treatment for introduction of the construct(s) and 
integration of the construct(s). It may be most convenient to use simple transfection procedures. 
The cells will then be expanded and screened by virtue of a marker present in the constructs. 
Various markers which may be used successfully include hprt, neomycin resistance, thymidine 
kinase, hygromycin resistance, etc.. and various cell-surface markers such as Tac. CD8, CD3, 
Thy1 and the NGF receptor. 

In some instances, one may have a target site for homologous recombination, where it is 
desired that a construct be integrated at a particular locus. For example, one can delete and/or 
replace an endogenous gene (at the same locus or elsewhere) with a recombinant target construct of 
this invention. For homologous recombination, one may generally use either O or O-vectors. See. 
tor example. Thomas and Capecchi, Cell (1 987) 51. 503-51 2;. Mansour,_et_aU Nature (198B) 
336. 348-352: and Joyner. et al.. Nature (1989) 338. 153-156. 

The constructs may be introduced as a single DNA molecule containing alt of the genes, or 
different DNA molecules having one or more genes. The constructs may be introduced 
simultaneously or consecutively, each with the same or different markers. 

Vectors containing useful elements such as bacterial or yeast origins of replication, 
selectable and/or amplifiabie markers, promoter/enhancer elements for expression in 
procaryotes or eucaryotes. and mammalian expression control elements, etc. which may be used to 
prepare stocks of construct DNAs and tor carrying out transfeclions are well known in the art. and 
many are commercially available. 

Additional background information and general guidance to the practitioner with respect to 
design, assembly, incorporation into plasmids, and transfection of constructs for such fusion 
proteins and reporter genes is available in the following published international patent 
applciations: WO 94/18317. WO 95/02684, WO 96/20951, WO 95/24419 and WO 96/41865. 
the contents of which are incorporated herein by reference. 

IV. The Assay Method of the Invention 

Genetically engineered cells containing the various DNA constructs described above may b* 
used to carry out the assay method of this invention. Such assays permit one to identify the 
presence in a test composition of an inhibitor of the binding of a PBP with a tyrosine- 
phosphorylated ligand therefore. Test compositions or compounds to be assessed for their 
inhibitory activity can be obtained from a variety of sources, including, for example, microbial 
broths, cellular extracts, conditioned media from ceils, synthetic compounds and combinatorial 
libraries, and may be tested individually or in pools. The assay method of this invention may be 
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used to screen nature! product and test compound iibrari s or structurally-biased diversity 
libraries t identify desired inhibitors. The test composition may be selected from a mixture of one 
or mor test peptides, wherein said mixture is provided in the form f a library of synthetic 
peptides or in the form of a phage library displaying the van us peptides. 

5 

A The General Method 

The method involves the use of a cell as described above, particularly a cell in which the 
second fusion protein also contains a heterologous kinase domain or the cell contains an endogenous 
or heterologous kinase capable of tyrosrne-phosphorytating the ligand domain. According to this 

0 method, the above-described engineered cells are cultured or maintained in a conventional culture 
medium under suitable conditions permitting growth of the cells both in the presence and absence of 
the test composition. For example, the cell is cultured in standard tissue culture media containing 
the drugs necessary to select for cells which stably retain the introduced genes described above. 
Cells may be cultured in standard tissue culture dishes, e.g. multldishes and mtcrowell plates, or in 

5 other vessels or arrays of vessels, as desired. 

The test compounds are added to the culture media to assay their effect on PBP interaction. 
Cells at this time may be bathed in tissue culture media (with or without serum) or a balanced salt 
solution. The compounds to be tested may be ceil permeable and therefore added directly to cells. 
Alternatively, it may be necessary to first make the cells permeable using streptolysin O. 

0 tetanorysin or another cell permeabitizing agent. 

During incubation and growth of the cells in the absence of an inhibitor test composition, the 
cell is capable of expressing the above recited fusion proteins and receptor gene product. Normally 
this involves the DNA-binding fusion protein binding to the DNIA element of the receptor gene. 
Tyrosine residues in the peptide ligand domain are phosphorylated by the kinase activity 

5 (endogenous or heterologous) in the cell. The phosphorylated peptide ligand, e.g., an IT AM, binds its 
PBD on the other fusion protein, e.g., an SH2 domain [see, e.g.. Pawson. US Patent No. 5.352.660 
(4 October 1994)] to generate the two-hybrid complex. The end result is measurable expression 
of the detectable reporter gene product. 

Thus, an additional step of this method is determining whether production of the detectable 

3 gene product was diminished in the presence of the test composition. After transcription is induced, 
cells or media are evaluated to measure reponer gene expression. In some configurations, the assay 
is conducted in 96 well plates or other arrays, and the reporter gene expression is measured in 
situ. In other cases, cells or media are harvested and an extract prepared (if necessary) for 
evaluation. Comparison of the measurements of detectable reporter gene product in the presence 

3 and absence of the test composition permits the identification of the test composition as an inhibitor 
of the PBP interaction. Transcription of the detectable gene product induced due to PBP interactions 
is assayed using standard colorimetric or fluorescence assays. 

For example. If the test compound is not an inhibitor of the formation and/or stability of the 
PBP complex, the measurement of detectable reporter gene product in the presence and absence of 

) the text composition will be about equivalent, since the results of a noninhtbrtor will be similar to 
results obtained with no test compound, as described above. If the test compound is a putative 
inhibitor of the PBP interaction, a decrease in the level reporter gene expression is observed. This 
decrease indicates that test compound inhibited the binding of the PBD to its phosphorylated peptide 
ligand. and thus inhibited the formatk) or persistenceof the two-hybrid pomplex. The receptor gene 

1 expression is thus extinguished or reduced, depending on the amount of test composition present in 
the culture and the strength of the binding between the test compound and either member of the 
PBP. 

For example, in one embodiment of this invention, test compounds are screened in eukaryotic 
host cells for blocking/inhibiting the interaction between the ZAP-70 tandem SH2 domain with the 
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TCR zeta IT AM rn vivo <se Example 3). Compounds that block ZAP SH2 binding to ITAM s quence 
are detected using a first fusion protein comprising the vSrc kinase domain, estrog n reeept r 
domain and the TCR zeta ITAM ligand fused to a DNA-binding domain and a second fusion protein 
comprising the tandem SH2 domain of ZAP fused to a TAD. Th interaction of the ZAP SH2 domains 
with the zeta ITAM domain brings the activator domain in proximity to the DNA element, inducing 
reporter gene transcription. Compounds that block ZAP SH2- zeta ITAM interaction reduce 
reporter gene expression. 

In this embodiment, and by way of illustration, the assay method of this invention offers 
several advantages over traditional T cell assays. The assay readout is directly dependent on the 
interaction between the particular SH2 ligand and the SH2 domain-containing polypeptide (e.g.. the 
ZAP-70 tandem SH2 domain). In contrast, in standard T cell assays that measure cell 
proliferation, cytolytic activity or cytokine production, there are many steps between activation 
and the assay endpoints. Therefore, many nonspecific inhtoitors score in standard T cell assays, 
which will not score in the assay of this invention. 

B. Eliminating Non-Specific inhibition 

To exclude test compounds that inhibit reporter gene expression by nonspecific mechanisms 
(e.g.. by having a general inhibitory effect on cellular transcription or translation), test 
compounds are also assayed tor their effect on reporter gene expression when reporter gene 
transcription is not dependent on PBP interactions. For this purpose, an indicator cell line is 
utilized wherein the transcription activation domain is brought directly to DNA by the DBD (by 

xpressing in cells a DBD/transcription activation domain fusion). To rule out false positives based 
on otherwise nonrelevant effects of a test composition relating to the HBD. the fusion protein may 
further contain a hormone binding domain (HBD). In such cases, the only difference between the 
p srtive control indicator cell line and the assay cell line is the dependence of the assay cell line 

upon PBP complex formation for reporter gene_expresston._ 

General inhibitory effects of transcription are also revealed by performing radiolabeled 
uridine pulse tabeiings during exposure to the test compound. Inhibition ot general incorporation of 
radiolaoel indicates a general inhibitory effect on transcription. 

C Simultaneous Exposure ot the Cell to the Assay Components 

In another aspect of this method, the first fusion protein transacted into the cell contains an 
HBD as described above. The HBD is an important feature of this aspect of the method ot use of this 
c II Test compounds introduced into cells may be labile and therefore not present tor long periods 
of time once introduced into the cell. II would be advantageous to have all interacting components 
present in the cell at the time the cells are exposed to potential inhibitors of the PBP interaction. 

Inhibitors of phosphopeptide binding protein-protein interactions could exert their effect in 
two ways They could compete with the iigand/HBD/TAD domain tor the PBD or displace the 
ligand/HBD/ TAD. which has complexed with the PBD. Mechanistically, it will be easier to block 
than displace the interaction between the binding domain and its ligand. Potential inhibitors, i.e.. 
t st compounds, can be introduced into cells prior to complex formation (hormone addition) to 
block complex formation. 

D. Cell-Free Assay 

In another embodiment of this method, cell-free transcription systems prepared from cells 
as described above or prepared from the purified components described above may be used in lieu of 
the engineered cells. In a cell tree implementation, peptide ligands may be synthetic or 
nonsynthetic peptides, peptoids or small molecules identified from libraries, cell broths, natural 
extracts, etc. The method of the assay is the same as above, except that it is performed in a cell free 
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environment, In this case, the kinase domain sh uld preferably be part of the fusion protein 
containing the DBD and the ligand domain for the phosphopeptide binding domain. 

V. The Identified Pho phopeptld Binding Pair Inhibit rs 
5 A PBP inhibitor identified by the methods f this invention selectively binds to a PBD of 

interest, such as SH2 domain. Such an inhibitor can block or inhibit protein-protein. 

protein-peptide. protein-nucleotide, protein-polynucleotide, protein-lipid, protein-carbohydrate 

or protein-small molecule interactions mediated by a phosphopeptide binding domain of interest. 
Once an agent has been identified as an phosphopeptide binding or blocking/inhibiting agent, it 
0 can be produced using known methods, such as by recombinant methods of protein production or 

chemical synthesis. It can also be obtained from the source in which it was initially identified. 

A Counterscreens 

Having identified a PBP inhibitor by means of the above assay, rapid and high throughput 
5 counterscreens using cells expressing a monomelic transcription factor (i.e. a positive control 
fusion protein containing one or more copies of both the DBD and TAD) that drives reporter gene 
expression independent of a protein kinase can readily identify nonspecific inhibitors, or confirm 
inhibitor specificity. Test compounds identified as inhibitors by the method of this invention may 
be further evaluated tor binding activity with respect to one or more additional PBDs of interest. 

0 or with respect to additional proteins containing the domain(s), using various approaches, a 
number of which are well known in the art. 

The counterscreen assays may include all of the core features of the above-described general 
assay of this invention, but the fusion proteins are altered to determine whether the inhibitor 
binds to other ligands. or other domains. The readout is independent of the particular PBD peptide 

5 and peptide iigand used in the original screen. Such counterscreens may be repeated as described to 
obtain the specificity. Therefore, any nonspecific inhibitors (e.g., RNA transcription, protein 
synthesis) can be identified. 

For instance, such identified inhibitors may be evaluated for activity as competitive 
inhibitors of the binding of an SH2 domain with a phosphorylated ligand thereto (e.g. Pawson. US 

J Patent No. 5.352.660 (4 October 1994)). As another example, the counterscreen assay may use a 
control cell containing and capable of expressing the reporter gene and its associated control 
elements and a gene encoding a control fusion protein. The control fusion protein contains both the 
DBD and the TAD which are separately present in the respective DNA binding and transcription 
activator fusion proteins, previously described. Alternatively, two fusion proteins may be used, 
each containing one half of a binding partner with different binding specificities than those found 
tor PBD and the protein-protein interaction assayed as described above. The failure of the test 
compound to significantly alter the level of expression of the reporter gene confirms its activity as 
an inhibitor. 

Another counterscreen assay useful in evaluating test compounds shown to be inhibitors by 

1 the method of this invention for binding to one or more additional PBDs of interest is surface 
plasmon resonance (BIAcore®) technology (see, e.g.. Panayotou et at. Mot Cell. Biol 
3567-3576 (1993)). 

In another counterscreen. a plasmid that drives the expression of a nuclear located tyrosine 
phosphatase, linked to the estrogen receptor ligand binding domain can be introduced into the cells. 
A number of proteins have had their function rendered steroid hormone-dependent by fusion to the 
steroid hormone receptor ligand binding domain. The hormone binding domain of steroid hormone 
receptors acts as a molecular switch. In the absence of cognate ligand, the fusion protein is inactive. . 
However, hormone binding leads to the very rapid (within seconds or minutes) "gain of function - of 
the chimeric protein. Heterologous proteins made hormone-dependent tor function include both 
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nzymes and transcription factors. Thus, with the addition f estrogen, detectable protein 
production would be curtailed, du to the d phosphorylation of the peptide ligand. Such c lis couU 
be us d to scr en for phosphatas inhibitors that block th reduction f reporter production that 
would occur when th celts are exposed to estrogen. 

In a further possible counters creen, compounds may be tested in an assay as described 
herein, but replacing the PBD and peptide ligand domains with SH3 and SH3 ligand domains as 
described in W095/24419. Compounds should not be active unless they inhibit a non PBP-related 
component of the assay system or also inhibit SH3-mediated protein-protein interactions. 

The inhibitors identified in the assay system of this invention can be further evaluated by 
conventional methods tor possible therapeutic applications, toxicotogtcal and pharmacological 
activity. For example, test compounds so identified as inhibitors may further be evaluated for 
activity in inhibiting cellular or other biological events mediated by a pathway involving the 
PBD-based interaction of interest using a suitable cell-based assay or an animal model. Cell-based 
assays and animal models suitabie for evaluating inhibitory activity of a test compound wtth 
respect to a wide variety of cellular and other biological events are known in the art. New assays 
and models are regularly developed and reported in the scientific literature. 

By way of noniimiting example, compounds which bind to an SH2 domain involved in the 
transduction of a signal leading to asthma or allergic episodes may be evaluated in a mast cell or 
basophil degranulation assay. The inhibitory activity of a test compound identified as an SH2 
inhibitor by the method of this invention with respect to cellular release of specific mediators such 
as histamine, leukotrienes. hormonal mediators and/or cytokines, as well as its biological activity 
with respect to the levels of phosphatidylinositol hydrolysis or tyrosine phosphorylation can be 
characterized with conventional in vitro assays as an indication of biological activity. [See. e.g.. 
Edward L. Barsumian et al. P..r J Immunol.. 11:317-323 (1981); M. J. Forrest. BiPCfiem , 
Pharmacol. . 42:1221*1226 (1991) (measuring N-acetyl-betaglucosaminadase from activated 
neutrophils) : and V. M. Stephan et al.. J, Biol Chem., 2£Z:5*34-S*41 (1992)]. 

For example, histamine release can be measured by a radioimmunoassay using a kit available 
from AMAC Inc. (Westbrook. ME). One can thus evaluate the biological activity of inhibitors 
identHied by the method ot this invention and compare them to one another and to known active 
compounds or clinically relevant compounds which can be used as positive controls. 

Generally speaking, in such assays IC50 scores ot 150-300 uM are considered of interest, 
scores of 50-150 uM are considered good, and scores below about 50 uM are of high interest. 
Prior to in vivo models, inhibitors identified by this invention may also be tested in an ex vivo 
assay for their ability to block antigen-stimulated contraction of sensitized guinea pig tracheal 
strip tissue. Activity in this assay has been shown to be useful in predicting the efficacy ot 
potential anti-asthma drugs. 

Numerous animal models ot asthma have been developed and can be used [for reviews, see 
Larson, ■Experimental Models of Reversible Airway Obstruction*, in THE LUNG. Scientific 
Foundations. Crystal. West et al. (eds.). Raven Press. New York. pp. 953-965 (1991); Warner 
et al., Am. Rev Rpsnir Pis.. 1 41 .253-257 (1990)]. Species used in animal models of asthma 
include mice, rats, guinea pigs, rabbits, dogs, sheep and primates. Other in vivo models available 
are described in Cross et al.. t ab invest.. £3:162-170 (1990); and Koh. et al., Science, 
256 :1210-1213 (1992). 

By way of further example, compounds identified as PBP inhibitors by the method of this 
invention which bind to an PBD involved in the transduction of a signal involved in the initiation, 
maintenance or spread of cancerous growth may be evaluated in relevant conventional in vitro and 
in vivo assays. See e.g.. Ishii et al., J Antibiot.. XJJI: 1877-1 878 (1989): and US Patent 
5.206.249 (issued 27 April 1993). 
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VI. Uses of Inhibitors id ntified by This Invention 

Inhibitors identrfl d by this invention may be used as biological r agents in assays as 
describ d herein for functional classification of a PBD of a particular prot in, particularly a n wty 
discovered protein. Families or classes of PBD-b aring prot ins may now be defined functionally, 
5 with respect to ligand specificity. Moreover, inhibitors identified by this invention can be used to 
inhibit the occurrence of biological events resulting from molecular interactions mediated by a 
PBP. Inhibiting such interactions can be useful m research aimed at better understanding the 
biology of PBP-medlated events. 

Such binding or blocking agents would be useful, for example, in the diagnosis, prevention or 
I 0 treatment of conditions or diseases resulting from a cellular processes mediated by a PBP 

interaction. For example, a patient can be treated to prevent the occurrence or progression of 
osteoporosis or to reverse its course by administering to the patient in need thereof an SH2 binding 
or blocking agent which selectively binds Src SH2. 

There are many other conditions for which phosphopeptide binding or blocking agents may be 
5 useful therapeutically, including, e.g.. breast cancer where the SH2 domain-containing proteins 
Src, PLCgamma and Grb7 have been implicated. Other relevant conditions include prostate cancer, 
in which case targeting Grb2. PLCg, and PI3K. all of which contain SH2 domains, may be useful in 
treatment or prevention of the disease, inhibition of the interaction of Grb2 or Abl SH2 domains 
with Bcr-abl may be useful to treat chronic myelogenous leukemia (CML) or acute myelogenous 
0 leukemia (AML). 

Still other relevant applications of an PBP inhibitor would be to prevent interferon-, growth 
factor-, or cytokine-mediated diseases (e.g. inflammatory diseases) by targeting the PBDs of STAT 
proteins. Agents that block the SH2 domains of ZAP-70. which is believed to be involved in 
activation of T-cells. would be useful in the treatment of autoimmune diseases. An inhibitor that 

5 blocks one or both SH2 domains of 2AP-70 would also be useful as an immunosuppressant to 
prevent rejection of skin and organ transplants. 

By virtue of the capacity to inhibit protein-protein interactions required for cellular events 
of pharmacologic importance. PBD/peptide ligand interaction inhibitors identified by the method 
may be used in pharmaceutical compositions and methods tor treatment or prevention in a subject 

0 in need thereof. Such inhibitors can be used to treat or prevent the diseases or their pathological 
effects mediated by such interactions. 

For example, drugs that completely block one of the two ZAP SH2 domains should effectively 
prevent ZAP from associating with the activated TCR and thus block T cell activation. A ZAP 
antagonist or inhibitor would specifically inhibit T cells and avoid the toxicity of the currently 

5 used immunosuppressive drugs, FK506 and cyclosporin, which target the more ubiquitously 
expressed protein, calcine urin. Since calcineurin is required for cellular activities in several 
tissues in addition to T cells, cyclosporin and FK506 cause side effects in the kidney and central 
nervous system which limit their application largely to patients with organ transplant rejection. 

0 VII. Pharmaceutical Compositions and Methods 

A Compositions 

Inhibitors identified by this invention can be formulated into pharmaceutical compositions 
containing a therapeutically (or prophylactically) effective amount of the inhibitor in admixture 
5 with a pharmaceutically acceptable carrier and/or other excipients (i.e., pharmaceutical^ 

acceptable organic or inorganic carrier substances suitable for parenteral administration) using 
conventional materials and means. Such a carrier includes but is not limited to saline, buffered 
saline, dextrose, water, glycerol, ethanol, and combinations thereof. The carrier and composition 
can be sterile. The formulation should suit the mode of administration. 
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The composition, rf desired, can also contain minor amounts of wetting r emulsifying agents, 
or pH buffering agents. The composition can be a liquid solution, suspension, emulsion, tablet, pill, 
capsule, sustained release formulation, or powder. The composition can be formulated as a 
suppository, with traditional binders and carriers such as triglycerides. Oral formulation can 
include standard carriers such as pharmaceutical grades of mannitol, lactose, starch, magnesium 
stearate. sodium saccharine, cellulose, magnesium carbonate, etc. 

In a specific embodiment, the composition is formulated in accordance with routine 
procedures as a pharmaceutical composition adapted for intravenous administration to human 
beings. Typically, compositions for intravenous administration are solutions in sterile isotonic 
aqueous buffer. Where necessary, the composition may also include a solubilizing agent and a local 
anesthetic to ease pain at the side of the injection. Generally, the ingredients are supplied either 
separately or mixed together in unit dosage form, for example, as a iyophilized powder or water 
free concentrate in a hermetically sealed container such as an ampoule or sachette indicating the 
quantity of active agent. Where the composition is to be administered by infusion, it can be 
dispensed with an infusion bottle containing sterile pharmaceutical grade water or saline. Where 
the composition is administered by injection, an ampoule of sterile water for injection or saline 
can be provided so that the ingredients may be mixed prior to administration. 

Topical compositions include a pharmacologically acceptable topical carrier, such as a gel. an 
ointment, a lotion, or a cream, which includes, without limitation, such carriers as water, 
glycerol, alcohol, propylene glycol, fatty alcohols, triglycerides, fatty acid esters, or mineral oils. 
Other topical carriers include liquid petroleum, isopropyl patmitate, polyethylene glycol, ethanol 
(95%), polyoxyethylene monoiaurate (5%) in water, or sodium lauryl sulfate (5%) in water. 
Other materials such as anti-oxidants, humectants, viscosity stabilizers, and similar agents may 
be added as necessary. 

Materials and methods for producing the various formulations are well known in the art [see 
e.g. US Patent Nos. 5.182.293 and 4.837.311). 

B. Methods 

The invention provides methods of treating, preventing and/or alleviating the symptoms 
and/or severity ol a disease or disorder referred to above by administration to a subject of a PBP 
inhibitor in an amount effective therefor. The subject will be an animal, including but not limited 
to animals such as cows. pigs, chickens, efc.. and is preferably a mammal, and most preferably 
human. By •mammals - is meant rodents such as mice, rats and guinea pigs as well as dogs, cats, 
horses, cattle, sheep, nonhuman primates and humans. Such effective amounts can be readily 
determined by evaluating the inhibitors identified by this invention in conventional assays 
well-known in the art, including assays described herein. 

Administration ol such composition may be by any conventional route using appropriate 
formulations as are well known in this art. Various delivery systems are known and can be used to 
administer the inhibitor, e.g.. encapsulation in liposomes, microparticles, microcapsules. One 
mode of delivery of interest is via pulmonary administration. Other methods of introduction include 
but are not limited to intradermal, intramuscular, intraperitoneal, intravenous, subcutaneous, 
intranasal, epidural, nasal and oral routes. The inhibitor may be administered by infusion or bolus 
injection, by absorption through epithelial or mucocutaneous linings (e.g.. oral mucosa, rectal and 
intestinal mucosa, efc.) and may be administered together with other biologically active agents. 

Administration can be systemic or local. For treatment or prophylaxis of nasal, bronchial or 
pulmonary conditions, preferred routes of administration are oral, nasal or via a bronchial aerosol 
r nebulizer. In specific embodiments, it may thus be desirable to administer the inhibitor locally 
to the area m need of treatment; this may be achieved by, for example, and not by way of limitation, 
local infusion during surgery, topical application, by injection, by means of a catheter, by means 
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of a suppository, or by means of a skin patch r implant, said implant being of a porous. 

nonp rous. or g Jattnous material, including membranes, such as siaiastic membranes, or fibers. 
Administration to an individual of an effective amount of the inhibitor can also be 

accomplished topically by administering th compound(s) directly to the aff cted area f the skin of 
5 the individual. In certain instances, it is expected that the inhibitor may be disposed within devices 

placed upon. in. or under the skin. Such devices include patches, implants, and injections which 

release the compound into the skin, by either passive or active release mechanisms. 

The amount of the inhibitor which will be effective in the treatment or prevention of a 

particular disorder or condition will depend on the nature of the disorder or condition, and can be 
0 determined by standard clinical techniques. In addition, in vitro or in vivo assays may optionally be 

employed to help identify optimal dosage ranges. Effective doses may be extrapolated from 

dose-response curves derived from in vitro or animal model test systems. For example, a typical 

effective dose of the inhibitor is in the range of about 0.01 to about 50 mg/kgs, preferably about 

0.1 to about 10 mg/kg of mammalian body weight, administered in single or multiple doses. 
5 Generally, the inhibitor may be administered to patients in need of such treatment in a daily dose 

range of about 1 to about 2000 mg per patient. 

The precise dosage level of the inhibitor, as the active component(s), should be determined by 

the attending physician or other health care provider and will depend upon well known factors. 

including the phosphopeptide binding interaction under consideration, the route of administration. 
0 and the age. body weight, sex and general health of the individual; the nature, severity and clinical 

stage of the disease; and the use (or not) of concomitant therapies. 

C Kits 

The invention also provides a pharmaceutical pack or kit comprising one or more containers 
5 filled with one or more of the ingredients of the pharmaceutical compositions of the invention. 
Optionally associated with such container(s) can be a notice in the form prescribed by a 
governmental agency regulating the manufacture, use or sale of pharmaceutical or biological 
products, which notice reflects approval by the agency ol manufacture, use or sale for human 
administration. 

3 Other components such as physiologically acceptable surfactants (e.g., glycerides). 

excipients (e.g., lactose), carriers, and diluents may also be included. 

The following examples illustrate various aspects of this invention. These examples do not 
limit the scope of this invention which is defined by the appended claims. 



Examples 

EXAMPLE 1: PLASMIDS 

) A Ptasmid pYSZ, a GAL4 (U147)-vSrc kinase-Zeta ITAM yeast expression vector. 

Plasmid pGTB9 (Ciontech Laboratories) was used tor the expression of GAL4 (1-l47)-vSrc 
kinase-Zeta ITAM in yeast. This plasmid has a 2 micron origin of replication and sequences for 
Amp (bacteria) or Trp (yeast) selection. pGTB9 also has ADH1 regulatory sequences that direct the 
expression of the GAL 4 DBD fusion proteins. Downstream of GAL4 coding sequences are multiple 
restriction endonudease cloning sites. 

The vSrc gene (Genbank Accession #J02342) kinase domain was introduced downstream of 
GAL4 sequences in pGTB9 as an EcoRI/BamHI fragment after PCR amplification. The 5* and 3' 
oligonucleotides, respectively, utilized for PCR were: (EcoRI end) cgGAATTCtccaagccccagaccca [SEO 
ID NO:1) and (Bam H1 end) gcGGATCCctcagcgacctccaaca [SEQ ID NO:2]. These oligos amplify 
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ssidues 248-526 of vSrc. PCR generated vSrc kinase DNA was sequenced after cloning to check 
ie fidelity of the amplification process. 

Sequences encoding human Zeta chain cytoplasmic r sidues (G nbank Accession* J04132. 
»sidues 52-164), cloned into pET23b (Invitrogen) were xcised using Nde 1 and Hind III. The 
•striction fragment encoding Zeta residues was blunt ended using T4 DNA polymerase and Ugated to 
lunt ended Sail digested pGTB9-vSrc Kinase DNA. The recombinant DNA was digested with 
sstriction endonucleases to identify a clone which contained Zeta coding sequences in the proper 
rientation. 

B. pYAZ22, a GAL4 activation domain-tandem ZAP SH2 yeast expression vector. 

Plasmid pGAD424 (Clontech Laboratories) was used tor the expression of GAL4 activation 
omain-tandem ZAP SH2 fusion protein in yeast pGex2KT-huZAP( 1-259). containing residues 
-259 of human ZAP-70 (Genbank Accession # L05148) was digested with BamHI and blunt ended 
sing T4 DNA polymerase. After the addition of EcoRI linkers (New England Biolabs. catalog 
1020), nucleotide sequence encoding the tandem SH2 domains of ZAP (residues 1-259) was 
;olated as an EcoRI fragment and cloned into the EcoRI site of pGAD424. 

The recombinant DNA was digested with restriction endonucleases to identity a clone which 
ontained ZAP SH2 domain coding sequences in the proper orientation. 

C pYerSZ, an estrogen regulated GAL4 (1-147)-vSrc kinase-Zeta ITAM yeast expression 
'ector. 

Human estrogen receptor ligand binding domain (Ld.b.) coding sequences (residues 
•82-595) were amplified by PCR from pHuER (Genbank Accession* M12674) using the 5' and 3 
ihgonucleotide primers, respectively, (EcoRI end) cgGAATTCtctgctggagacatgagagct [SEO ID NO:3] 
ind (EcoRI end) cgGAATTCgactgtggcagggaaaccct fSEQ ID NO:4). After digesting the amplified DNA 
istng restriction endonuclease EcoRI. estrogen receptor l.d.b. sequences were cloned into EcoRI cut 
)YSZ (Example 1A). A recombinant clone was isolated that had estrogen receptor sequences in the 
ight orientation. This new plasmid. pYerSZ, directs the expression of GAL4 DBD-estrogen 
eceptor l.d.b.-vSrc kinase-Zeta ITAM fusion protein in yeast. 

D. pMSZ. a GAL4 (1-147)-vSrc kinase-Zeta ITAM mammalian cell expression vector. 

pBXG1 a pECE72-based vector (Sadowski I. and Ptashne. M.. NUCt Acids Res.. 1Z:7539 
1989)1 was used for the expression of GAL4 DBD fusion proteins in mammalian cells. pECE72 
Ellis L et at £flU. 4&:72 1-732 (19B6)] has the SV40 virus origin of replication. SV40 early 
promoter and SV40 polyadenylation regulatory sequences. GAL4 (1-147) and multiple cloning 
sites C-terminal to GAL4 coding sequences were obtained from pSKGAL147 [Kakidani, H. and 
Ptashne, M.. CfilL 52:161-167 (1988)] as a Hind Ill/Xbal fragment and inserted into Hmd 
!II/Xbal cut pECE72 to generate pBXG1. 

The sequences encoding vSrc kinase Zeta ITAM were excised from pYSZ (Example 1 A) as an 
EcoRI/Bglll fragment and cloned into EcoRl/BernH! digested pBXG1 to generate plasmid pMSZ. 
capable of directing the expression ol GAL4 (1.147)-vSrc kinase-Zeta ITAM in mammalian cells. 

E pMAZ22, a Herpes Virus VP16 activation domain-tandem ZAP SH2 domain mammalian 

cell expression vector. . . tKa 

pMVN1 (Ivan Sadowski. University of British Colombia. Vancouver. CA) was used tor me 
production of transcription activation domain-tandem ZAP SH2 domain fusion proteins in 
mammalian cells. pMVN 1 contains the SV40 early promoter. HSV TK translation^ leader 
sequence. SV40 nuclear localization sequences and VP16 activation domain residues followed by 
multiple cloning sites tor the construction of novel fusion proteins. The sequences encoding the 
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tandem SH2 domains f ZAP (residues 1-259) wer excised from pYA222 (Example 1B) as an 
EcoRI fragment and cloned into EcoRI digested pMVN1 in the proper orientation. 

F. pMerSZ. a GAL4 (1-l47)-estrogen receptor t.b.d. -vSrc kmase-Zeta ITAM 
5 mammalian cell expression vector. 

Human estrogen receptor l.d.b. coding sequences were prepared as described in 1C above. 
After digesting the amplified DNA using EcoRI, estrogen receptor iJb.6. sequences were cioned 
into EcoRI-cut pMS2 (Example 1D). A recombinant clone was isolated that had estrogen receptor 
sequences in the right orientation. This new plasmid. pMerSZ. directs the expression of GAL4 
0 DBD-estrogen receptor l.b.d.-vSrc kinase-Zeta ITAM fusion protein in mammalian cells. 

G pYSB, a GAL4 (1-l47)-vSrc kinase-Beta ITAM yeast expression vector. 
pGTB9-vSrc Kinase DNA (see Example 1 A) was engineered for the production of GAL4 
DBD-vSrc kinase-Beta ITAM chain fusion protein production. Beta sequences (Genbank Accession* 
5 S21154) were amplified from pCD8-beta [Rivera, V. and Brugge, J., Mol Cell Biol. . 
Hi: 1582- 1590 (1995)] using the 5' and 3' oligonucleotides, respectively: 
(BamHI end) gcGGATCCggagctggggaagaactca [SEO ID NOS] and (Sail end) 

acgcGTCGACttataaatcaatgggaggag [SEQ ID NO.:6]. The resulting amplified DNA, after digestion with 
BamHI and Sail, was cloned into BamHI/Sall digested pGTB9-vSrc Kinase 

0 

H. pYerSB, a GAL4 (1-147)-estrogen receptor t.b.d. -Src kinase-Beta ITAM yeast 
expression vector. 

The same human estrogen receptor l.b.d. coding sequences were prepared and amplified by 
PCR as described in 1C above and cloned into EcoRI cut pYSB (Example 1G). A recombinant done 
5 was isolated that had estrogen receptor sequences in the right orientation. This new plasmid, 
pYerSB. directs the expression of GAL4 DBD-estrogen receptor l.b^d.-vSrc kinase-Beta ITAM 
fusion protein in yeast. 

/. pYAS32, a GAL4 activation domain-Src SH3/SH2 yeast expression vector. 

} pGAD424 (Cionetech Laboratories) was used for the production of transcription activation 

domain-human Src SH3/SH2 domain fusion proteins in yeast cells. Sequences encoding the Src 
SH3/SH2 domain (residues 84-249) [Tanaka. A. and Fujita. D.J., Mol. Cell. Biol. 6:3900-3909 
(1986)] were amplified by PCR using the 5* and 3* oligonucleotides, respectively: 
(EcoRI end) cctcacGAATTCgg^gagtgaccaccmgtggcc [SEQ ID NO: 7) and (BamHI end) 

5 ccactcGGATCX^a^ggcacacggtggtgaggc [SEO ID NO:8J. After restriction endonuclease digestion using 
EcoRI and BamHI, the PCR product was ligated to EcoRI/BamHI digested pGAD424. 

J. pYAS2. a GAL4 activation domam-Src SH2 yeast expression vector. 
PGAD424 was used for the production of transcription activation domain-human Src SH2 
) domain fusion proteins in yeast cells. Sequences encoding the Src SH2 domain (residues 

144-249) (see, Tanaka, A. and Fujita, D.J. (1986) cited above) were amplified by PCR using the 
5' and 3' oligonucleotides, respectively: 

(EcoRI end) cctcacGAATTCggcgactccatccaggctgaggag (SEO ID NO:9) and (BamHI end) 
ccactcGGATCC^ccggg^c^cacxjg^tgag^ [SEQ ID MO:10J. 

After digestion with EcoRI and BamHI. the PCR fragment was cioned into EcoRI/BamHI 
digested pGAD424. 
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K. pMSB. a GAL4 (U147)-vSrc kmase-Beta ITAM chain mammalian cell expression 

/0CfO pYSB (Example 1G) was cut with r striction endonucleases EcoRI and Bgltl and the DNA 
ragment encoding vSrc kinase-Beta ITAM chain residues was isolated and hgated to EcoRI/BamHI 
digested pBXG1 (Example 1G). 

L. pMerSB, a GAL4 (1-147)~estrogen receptor l.b.d. -vSrc kinase-Beta ITAM chain 
mammalian cell expression vector. 

pMSB (Example 1K) was cut with restriction endonucfease EcoRI. An EcoRI DNA tragmern 
containing the human estrogen receptor l.b.d. coding sequences, prepared as described m 1C above, 
was ligated to EcoRI digested pMSB. A recombinant clone was isolated that had estrogen receptor 
sequences in the right orientation. This new plasmid. pMerSB. directs the expression of GAL4 
DBD-estrogen receptor l.b.d. vSrc kinase-Beta ITAM fusion protein in mammalian cells. 

M. pMAS32. a Herpes Virus VP1S activation domain-human Src SH3/SH2 domain fusion 
protein mammalian cell expression vector. 

pMVN 1 (Ivan Sadowski, Univ. B.C., Vancouver, CA) was used for the production of 
TAD-human Src SH3/SH2 domain fusion proteins in mammalian cells. Sequences encoding the Src 
SH3/SH2 domain (residues B4-249; see above) were amplified by PCR using the 5' and 3' 
oligonucleotides, respectively: 

(EcoRI end) cctcacGAATTCggtggagtgaccacctttgtggcc [SEQ ID NO:1 1| and (BamHI ond) 
ccactcGGATCCgccggggcacacc^tggtc^ggc [SEQ ID NO:1Z). 

After restriction endonudease digestion using EcoRI and BamHI. the PCR product was ligated 
to EcoRI/BamHI digested pMVN 1. 

N. pMAS2. a Herpes Virus VP16 activation domain-humar \ Src rSH2'domain r fusion protein 
mammalian cell expression vector. 

Sequences encoding the Src SH2 domain (residues 144-249) fTanaka et al. supra) were 
amplified by PCR using the 5' and 3* oligonucleotides, respectively: 
(EcoR I end) cctcacGAATTCggcgactccatccaggctgaggag {SEO ID NO:13] and (BamHI end) 
ccactcGGATCC^ccggggcacacggjggtgaggc [SEQ ID NO.14]. 

After digestion with EcoRI and BamHI. the PCR fragment was cloned into EcoRI/BamHI 
digested pMVN1 (Example 1M). 

Q pMerVP. a GAL4 DBD-estrogen receptor Lb.d.-VP 16 TAD fusion protein mammalian 

cell expression vector. 

An EcoRI DNA fragment containing the human estrogen receptor Lb.d. coding sequences 
prepared as described in 1C above was ligated to EcoRI digested pBXG1 (Example 1G). Clones 
containing the estrogen receptor sequences in the right orientation (designated pMer) were 
identified by restriction endonucleases analysis. VP16 TAD residues were amplified using template 
plasmid pMVN1 (Example 1M) and the 5' and 3* oligonucleotides, respectively: (BamHI 
cgGGATC^gcctgggggacgagctc [SEQ ID NO:15) and (Spel end) ggACTAGTcccaccgtactcgtcaat ^JSEQ ID 
NO:16]. After digestion with BamHI and Spel, the PCR fragment was cloned into BamHl/Spei 
digested pMer. 

p P MAF2. a Herpes Virus VP16 activation domain-human Fyn SH2 domain fusion protein 
mammalian cell expression vector. 
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Sequences encoding the Fyn SH2 domain (r sidues 149-251) [G nbank M14333] were 
amplified by PCR using the 5' and 3' oligonucleotides such that the Fyn SH2 domain coding 
sequences c ntained EcoRl and BamH1 nds. After digestion with EcoRI and BamHI, the PCR 
fragment was cloned into EcoRI/BamHI digested pMVN1 (Example 1M). 

EXAMPLE 2: STABLE MAMMALIAN CELL LINE 

A variety of cell lines were established by introducing selected plasmid DNA, described in 
Example 1 into mammalian cells using lipofectin reagent (Gibco BRL) as suggested by the 
manufacturer. 

) A selected host cell line is the mammalian HT 1080 fibrosarcoma cell line [American Type 

Culture Collection Accession No. CRL7951]. These cetts are prepared by lipotection using a SEAP 
reporter construct comprising 

( a ) a GAL4 responsive/dependent promoter (5 GAL4 binding sites linked to the core 1L*2 
promoter, the "G5-IL2" promoter), which is placed upstream of 
5 ( b ) secreted alkaline phosphatase (SEAP) coding sequences; and 

a plasmid containing the hygromycrn gene. 

This cell tine G5-IL2 HT1080 was employed in a variety of the examples below. 

EXAMPLE 3: TWO-HYBRID MECHANISM-BASED IN VIVO ASSAY - ZETA 
) ITAM-TANDEM ZAP SH2-DEPENDENT TWO-HYBRID FORMATION IN MAMMALIAN 
CELLS 

In e prototypic example of an assay of this invention, a two hybrid interaction recapitulates a 
critical event that occurs during T-cell activation, namely the SH2 domain-dependent binding of 
the ZAP-70 tyrosine kinase to the TCR zeta IT AM chain of the T-ceH receptor. 

The G5-1L2 HT1080 fibrosarcoma cell line of Example 2 is transfected with two effector 
plas m id s _by lipotection: 

1 ) pMSZ (Example 1D) directs the expression of a novel fusion protein that consists of 
GAL4 DBD residues, the vSrc kinase domain and the human zITAM chain. This fusion protein should 
bind to DNA and have tyrosine kinase activity. The clustering of these molecules, localized at the 

) G5-IL2 promoter, should help promote efficient zITAM phosphorylation. 

2 ) pMAZ22 (Example 1E) drives the expression of the tandem SH2 domains of ZAP. fused 
to the herpes virus VP1 6 TAD. 

For use as controls and comparisons. HT1080 cells were also transfected with carrier DNA 
only, or pMSZ only, or pMS (GAL4 DBD-vSrc kinase), or pMS and pMAZ22. 

The transfected host cells are cultured in 6 well dishes. Forty-eight hours after 
transfection, the amount of SEAP released into the media was determined using a fluorescence assay 
JJ. Berger et a!, Gene . ££:1-10 (1988)]. The extent of two-hybrid formation (i.e. the effect on 
transcription and the amount of ZAP SH2-Z IT AM interaction) is ascertained by measuring the 
amount (i.e.. the activity) of the SEAP reporter gene released into the media. 

The results of this assay are shown in FIG. 2. Cells transfected with carrier DNA do not 
release SEAP into the media (FIG. 2, col. 1). GAL4 DBD-vSrc kinase-Zeta ITAM fusion protein, 
when expressed in ceils, is not a competent transcriptional activator (FIG. 2, col. 2). However, 
when in addition, VP16-tandem ZAP SH2 fusion protein is made in these cells, transcription is 
induced due to two-hybrid formation and the cells secrete copious amounts of SEAP into the culture 
media (FIG. 2. col. 3). Thus, the requirements necessary for the induction of SEAP production in 
KT1080 fibrosarcoma cells is consistent with the interaction of tyrosine phosphorylated Zeta ITAM 
and the tandem ZAP SH2 domains. The induction of SEAP production requires that both of the fusion 
proteins described above be expressed. The recognition and binding of tyrosine phosphorylated 
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zfTAM by the ZAP SH2 domains effectively bring the VP16 TAD to the promoter, thereby inducing 
transcription of the SEAP report r gene. 

Two hybrid formation is Zeta ITAM chain-dependent (compare FIG. 2. cols. 3 and 5). 
Furthermore, GAL4 DBD fusion proteins which lack the vSrc kinase domain (GAL4 DBD-Zeta 
ITAM). or which lack the Zeta chain, i.e., pMS (GAL 4 DBD-vSrc kinase), fail to induce SEAP 
production when produced in cells alone or with pMAZ22. i.e., the VP16 TAD-tandem ZAP SH2 
fusion protein (see FIG. 2. cols. 4 and 5). See. also. cote. 1. and 2 for controls. In addition to the 
results shown in FIG. 2. cells that contain plasmids that encode GAL4 DBD and VP16 TAD-tandem 
ZAP SH2. or GAL4 DBD-vSrc kinase-Zeta ITAM and VP16 TAD, faH to produce SEAP. 

Hence, in a heterologous cell line the tandem ZAP-SH2:hlTAM interaction that normally 
occurs in T-cells has been reproduced and a simple, sensitive means to monitor the extent to which 
these two molecules interact has been presented by this invention. 

Using SEAP production as an indicator of the interaction of these two molecules. It is possible 
to identity compounds that can specifically inhibit ZAP SH2 domain-dependent interactions. When 
the host celts are cultured, they are exposed to test compounds. The extent to which the test 
compounds successfully interrupt the interaction between the ZAP SH2 and the zITAM indicates the 
presence of an SH2 inhibitor. 

With the assay set up as described, one potential drawback when screening tor molecules that 
inhibit ZAP SH2-dependent interactions is that the constitutive expression of the interacting 
components of the two-hybrid screen would result in constitutive ZAP SH2-Zeta ITAM complex 
formation. Hence, this embodiment of an assay identifies only compounds that efficiently disrupt 
two-hybrid formation. 

EXAMPLE 4: TRANSCRIPTIONALLY INDUCED TWO-HYBRID FORMATION: AN 
ALTERNATIVE TWO-HYBRID ASSAY FOR MAMMALIAN CELLS SCREEN FOR 
COMPOUNDS THAT INHIBIT ZAP SH2 DOMAIN FUNCTION 

In another embodiment o! the assay of this invention, cells are exposed to potential inhibitory 
test compounds and then the interaction of proteins whose association is to be inhibited is induced. 
This embodiment of the assay depends on use of plasmids tor the production of both the Zeta ITAM 
and tandem ZAP SH2 fusion proteins in the cells (as described in Example 2). which have inducible 
transcriptional regulatory sequences, rather than constitutive (i.e.. SV40 virus based) 
transcriptional regulatory sequences. 

An example of an inducible transcriptional regulatory sequence includes the mouse mammary 
tumor virus (MMTV) dexamethasone inducible promoter. Other known regulatory sequences are 
useful here. Thus, if, for example, the sequences encoding GAL4 DBD-vSrc kinase-Zeta ITAM 
fusion protein were linked to the MMTV inducible regulatory sequences, fusion protein production 
would be dexamethasone-dependent. The reporter cell line is then exposed to potential inhibitory 
compounds and two-hybrid formation is induced using dexamethasone. In this embodiment Zeta 
ITAM-tandem ZAP SH2 two-hybrid formation occurs if SH2 domain(s) binding is not affected by 
inhibitory compounds. 

The mammalian G5-IL2 HT 1080 fibrosarcoma cell line (Example 2) is transtected wrth two 
effector plasmids by lipofection: 

( 1 ) One piasmid constitutively drives the expression of TAD - ZAP tandem SH2. 

( 2 ) The other piasmid encodes a fusion protein GAL4DBD-vSRC kinase domain - z ITAM 
expressed using the dexamethasone (Dex) inducible MMTV promoter. 

These transacted mammalian cells are seeded in 96 well dishes and serial dilutions of test 
compounds are added to the wells. Dex is added to the media. Phosphorylated z ITAM fusion protein 
accumulates in cells upon the addition of Dex. After a 24 hour incubation, media is removed from 
the 96 well dish and placed into a second 96 well dish to perform a chromogenic or fluorescence 



30 



WO 97/31113 



PC7YUS97/0263S 



assay lor SEAP. The SEAP assay is measured using a 96 well plate reader. The amount of SEAP 
released into the m dia can be quantitated by using rth r a chromogentc or floor sconce assay. 

By binding of the fusion prot in to the ZAP SH2 domains, the TAD is brought to the SEAP 
promoter and SEAP is produced. Compounds will be present in cells t bind to ZAP SH2 domains 
before dex-induced production of phosphoryiated h-ITAM occurs. Compounds that block ZAP SH2 
function reduce SEAP production. Compounds that reduce SEAP production are analyzed using cells 
expressing SEAP in a ZAP SH2-independent manner to confirm that the compound is having a 
specific effect on ZAP:SH2-z fTAM complex formation. 

EXAMPLE 5: TWO-HYBRID ASSAY DETECTS BETA ITAM-SRC SH2 
DOMAIN-DEPENDENT TWO-HYBRID FORMATION IN MAMMALIAN CELLS. 

A two-hybrid assay useful for detecting Src tyrosine kinase SH2 domain-dependent 
interactions is performed as follows. 

G5-IL2 KT 1080 cells (Example 2) were transfected with the following piasmid DMAs 

(a) pMSB (Example 1K: GAL4 DBD-vSrc kinase-BetalTAM); 

(b) pMSB and pMAS2 (Example 1N: VPl6-Src SH2); or 

( c ) pMSB and pMAZ22 (Example 1E; VPl6-tandem ZAP SH2). 

These transfected mammalian cells are seeded in 96 well dishes. Forty-eight hours after 
transfection, the amount ol SEAP released into the media was determined using a fluorescence assay. 

The results are shown in FIG. 6. col. 1 illustrates that pMSB (GAL4 DBD-vSrc kinase-lgE 
receptor Beta ITAM fusion protein) is not by itself a competent transcriptional activator. 
However, when pMAS2 (VP16-Src SH2 lusion protein) is also present in cells, transcription is 
induced (see col. 2). In contrast. VPl6-tandem ZAP SH2 does not efficiently interact with GAL4 
DBD-vSrc kinase-BetB fTAM fusion protein to induce SEAP production (see col. 3). 

When pMAS2 was co-expressed with GAL4 DBD-vSrc kinase-Zeta fTAM in the HT1080 
fibrosarcoma indicator cell line, SEAP production was not observed (data not shown), if beta ITAM 
residues are replaced with the Zeta ITAM sequences, two-hybrid formation (SEAP production) is 
not observed using these conditions. 9 

Summarizing both Examples 4 and 5. two-hybrid lormation appears to be ITAM and SH2 
domain specHic. The tandem ZAP SH2 domains bind to the Zeta ITAM (FIGS. 2 and 3), but not to the 
Beta ITAM (FIG. 6. col. 3). The Src SH2 domain efficiently binds to the Beta ITAM (FIG. 6) but not 
to Zeta ITAM residues. As observed when using the ZAP SH2 domains and Zeta ITAMs, Src SH2- 
dependent two-hybrid formation can be rendered inducible via the use of the estrogen receptor 
ligand binding domain residues, as discussed below. 

EXAMPLE 6: POST-TRANSCRIPTION ALLY INDUCED TWO-HYBRID FORMATION: A 
STEROID DEPENDENT TWO-HYBRID ASSAY FOR MAMMALIAN CELLS 

A Estrogen regulation of Zeta fTAM-tandem ZAP SH2 dependent two-hybrid formation in 
mammalian cells (transient transfections) 

The mammalian G5-IL2 HT 1080 fibrosarcoma cell line (Example 2) is transfected with two 
effector plasmids by lipofection: 

« . . I ) . P MerS2 (Example 1F) drives the production of GAL4 DBD-ER l.b.d.-vSrc kinase-Zeta 
ITAM fusion protein in mammalian cells. 

2 ) pMAZ22. VP16 TAD-tandem ZAP SH2 (Example 1E) 
nti* F ° r ~ n ^ fi nd comparisons, the G5IL-2 HT1080 cells were also transfected with carrier 
DNA, or with pMSZ only, or with pMSZ and pMAZ22. or with pMerSZ only 
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Each of these transfect d mammalian c Us are seeded in 6 well dishes and serial dilutions of 
test compounds are added to the wells. Tw nty-four hours after transfection. ne duplicat dish of 
cells was cultured in the presence of 10 nM strogen. 

Forty-eight hours after transfection, an aliquot of media from all of the dishes was removed 
from the 6 well dish and placed into a 96 well dish to perform a chromogenic or fluorescence assay 

tor ^j^^jdes the results of this assay: The cell line transfected with carrier DNA is 

reported in cols. 1 and 2; or with pMSZ (Example 1D) in cols. 3 and 4; or with both pMSZ and 
pMAZ22 in cols. 5 and 6. Cols. 7 and 8 are the results of the cell line transfected only with plasmid 
pMerSZ. Cols. 9 and 10 are results of the cell line transfected with both pMerSZ and pMAZ22. as 

deSCf The addiUon of the estrogen receptor l.b.d. residues to the GAL4 DBD-vSrc kinase-Zeta ITAM 
fusion protein makes two-hybrid formation estrogen-dependent (compare FIG. 3. cots. 9-10). 
That is in the absence of exogenously added estrogen, the fusion protein (GAL4-DBD-ER l.b.d- 
vSrc kinase-g-JTAM) is not functional. In tact. FIG. 3, col. 9 shows that only low levels of SEAP 
are detected in the media. This is presumably due to the low levels of estrogen present in serum. 
Low levels of SEAP are not observed if the cells are cultured in a defined synthetic media lacking 
estrogen When plasmids pMerSZ and pMAZ22 are expressed together in cells, two-hybrid 
formation (reporter gene transcription) should not occur (col. 9), unless estrogen is present in 
the media (col. 10). When these transfected cells were exposed to 10 nM estrogen, copious 
amounts 61 SEAP was released into the media. 

Estrogen did not markedly affect two-hybrid formation if the estrogen receptor l.b.d. was not 
present in either plasmid expressing fusion protein (cols. 5 and 6). Also, the estrogen receptor 
l.b.d. residues do not make the GAL4 DBD-vSrc kinase-Zeta ITAM fusion protein itself a 
transcriptional activator (FIG. 3. Cols. 7 and 8). 

SEAP induction was time-dependent, the majority of the SEAP was produced 12-18 hours 

after exposure of the ceils to estrogen (see FIG. 5 for analogous results with stably transfected 

cells). 

q Estrogen regulation of Beta tTAM-src SH2-dependent two-hybrid formation in 
mammalian cells (transient transfection). 

In a manner similar to that described above for the Zeta ITAM constructs. G5-IL2 HT 10BO 
cells were transfected with the following plasmid DNA (in duplicate): pMerSB or pMerSB « nd 
pMAS2 Twenty-four hours after transfection. one dish of cells was cultured in the presence of 10 
nM estrogen. Another twenty-tour hours later, an aliquot of media from all of the dishes was 

analyzed for SEAP activity. ^ 

As shown in FIG. 7. the presence <+) or absence (-) of estrogen is indicated, and the columns 
represent cells transfected with pMerSB (Example 1L) [Cols. 1 and 2) or with pMerSB and pMAS2 
(Example 1N) [Cols. 3 and 4). The addition of the estrogen receptor Lb.d. residues to the GAL4 
DBD-vSrc kinase-Beta ITAM fusion protein makes two-hybrid formation estrogen-dependent 
(compare cols. 3 and 4). In contrast, the estrogen receptor l.b.d. residues do not make the GAL4 
DBD-vSrc kinase-Beta ITAM fusion protein itsetf a transcriptional activator (cols. 1 and 2). 

C Stable cell line that constitutively expresses GAL4 DBD-estrogen receptor t.b.d.-Src 
ktnase~g-JTAM and ZAP-SH2 activation domains in mammalian cells. 

G5IL-2 HT1080 cells that contain an integrated GAL4-dependent SEAP reporter gene 
(Example 2) were transfected with pBabeNeo (neomycin selection). pMerVP (Example 10). and 
the two effector plasmids that drive the constitutive production of GAL4 DBD-estrogen receptor 
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l.b.d.-vSrc kinase-Zeta ITAM and VP 16 TAD-tandem ZAP SH2. i.e., pMerSZ (Example 1F) and 
pMAZ22 (Example 1E). r spectively, were transfected into the cells with neomycin. 

After selection with both hygromyctn and neomycin, cell lines wer gen rated that contain the 
reporter g ne (hygromyctn selection) and both ft dor plasmids (neomycin s I ction). Individual 
colonies were isolated and expanded. One set of duplicate cells was exposed to 10 nM estrogen. Both 
duplicate sets of cells were assayed for estrogen-induced (+inducer) SEAP activity. Those with 
markedly induced SEAP production were expanded for further analysis. 

Of the twelve colonies containing MerSZ and MAZ22 that were analyzed, five exhibited high 
levels of SEAP activity after exposure to estrogen (FIG. 4). Two of these clones (clone A and clone 
D) exhibited very high levels of estrogen-induced SEAP gene expression. The amount of SEAP in 
the media induced 200-told (cell line #3) and 600-fold (cell line «6) after the addition of 10 nM 
estrogen. Stable cell lines that combined estrogen-regulated MerUP were also obtained (Example 
6D). 

The time course of SEAP production this stable celt line with inducible Zeta ITAM -tandem ZAP 
SH2-dependent two-hybrid activity was studied as followed. Clone 0 (FIG. 4) was cultured in the 
presence of 10 nM estrogen and, at various times after the addition of estrogen, culture media was 
assayed for the presence of SEAP activity. As shown in FIG. 5, the bufic of SEAP production occurs 
by 12 hours. Clone D cells do not produce appreciable amounts of SEAP when estrogen is 
exogenously added to the media (see FIG. 4). Also, the parent cells (G51L-2 HT1080 cells) do not 
produce SEAP when cultured in the presence of 10 nM estrogen for 24 hours (compare -E. +E 
columns of FIG. 5). 

D. Control Cell Una 

When screening for compounds that block SH2-dependent two-hybrid formation (SEAP 
production), compounds that have nonspecific effects will also be identified. Such compounds 
include those that are cytotoxic, those that affect transcription, translation or secretion, those that 
affect GAL4 DNA binding or the estrogen receptor ligand binding domain. Hence, compounds that are 
identified using the SH2 dependent assay are tested using a cell line that constitutively expresses 
GAL4 DBD-estrogen receptor l.b.d.-VP16 fusion protein. 

The results with one clone (cell fine 6). containing an inducible GAL4-estrogen receptor 
l.b.d.-VP16 transcriptional activator in the presence (+) or absence (-) of estrogen, are shown in 
FIG. 8. Compounds that exert nonspecific effects as described above will score positive when using 
this cell line. 

EXAMPLE 7: METHODS FOR CHARACTERIZING BINDING ACTIVITY OF AN INHIBITOR 
IDENTIFIED BY THE METHOD OF THIS INVENTION 

Several methods for characterizing binding activity (e.g., counter-screens) of an inhibitor 
identified by the present invention are discussed below, where the PBD is an SH2 domain of 
interest. 

1 . Competitive Binding Assays: 

Binding is measured by competition using surface plasmon resonance (Malmqvist, M., Curr 
PPin, Immunol,. &. 2B2-286 (1993)] as implemented in the BIAcore<8> Biosensor (Pharmacia 
Biosensor, Piscataway, NJ). SH2 proteins, e.g. pp60src, pp70ZAP. or pp72syk. are 
pre-incubated with various concentrations of test compound and the ability of the test compound to 
competitively inhibit binding to a phophopeptide ligand measured. Results are compared to binding 
measured in the absence of competitor and expressed as percent inhibition. 

IC50 values reflect the concentration of inhibitor required to reduce binding by 50%. 
Specifics of individual assays are described below. All assays are run in HEPES Buffered Saline 
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(HBS) composed of 10 mM HEPES (pH 7.4) / 150 mM NaCI / 3.4 mM EDTA / 0.05% Surfactant 
P20 at 25 D C and a flow rate f 5 uL min-1. 

a. Specifics of Tandem Syk Assay 

A pp72syk peptid ligand corresponding to the g-chain ITAM of human FceRI 
[DGVY(P04)TGLSTRNQETY<P04)ETLK (SEO ID NO:17D was synthesized as part of a larger peptide 
(Ac-CGGDGVY(P04)TGLSTRNQETY.(P04)ETLK-NH2 [SEO ID NO:18fl and used to generate a 
Syk-sensitive biosensor surface. Specifically, a Biosensor Chip CM5 was activated with 200 mM 
ethyl-3-(3-dimethylaminopropyl)-carbodiimide hydrochloride (EDC)/ 50 mM 
n-hydroxysuccinimide (NHS) to generate a surface reactive to primary amines; treated with 
ethylenediamine to generate a surface rich in primary amines; activated with 
m-maleimidobenzoyl-n-hydroxysuccinirnide ester (sutfo-MBS; 50 mM in 25 mM NaHC03) to 
generate a surface reactive to free thiols; and the ITAM peptide immobilized through the 
n-terminal cysteine. Unreacted sites were blocked with b-mercaptoethanol and the chip cleaned of 
noncovalently bound peptide using 6 M guanidine hydrochloride. Assays were run in HBS using 20 
nM pp72syk( 1-265) W- test inhibitor. 

b. Specifics of C-Syk Assay 

A pp72syk peptide ligand corresponding to a hemiphosphorytated g-chain ITAM of human 
FceRI [DGVY(P04)TGLSTRNOETYETLK [SEO ID NO:19fl was synthesized as part of a larger peptide 
[Ac-CGGDGVY(P04)TGLSTRNOETYETLK-NH2 [SEQ ID NO:20]) and used to generate a 
C-Syk-sensitive biosensor surface as described above tor tandem syk. Assays were run in HBS 
using 270, nM pp72syk( 163-265) +/- test inhibitor. 

c. Specifics of Tandem ZAP Assay 

A pp70ZAP peptide ligand corresponding to the z-chain ITAM-1 of the human T-cell receptor 
INQLY(P04)NELNIGRREEY(P04)DVLD (SEQ ID NOi21fl was synthesized as a part of a larger 
peptide [Ac-KGGNOLY(P04)NELNIGRREEY-(P04)DVLD-NH2 (SEQ ID NO:22] and used to generate a 
ZAP-sensitive biosensor surface. Specifically, a Biosensor Chip CMS was activated with 200 mM 
EDC7~50lnM NHS to generate a surface reactive to primary amines and the ITAM peptide 
immobilized through the n-terminal lysine. Unreacted sites were blocked with ethanolamine (1 M 
in water) and the chip cleaned of noncovalently bound peptide using 6 M guanidine hydrochloride. 
Assays were run in HBS using 10 nM pp70ZAP(1-259) +/- test inhibitor. 

d. Specifics of Src Assay 

A p60src peptide ligand corresponding to the hamster Middle-T antigen [QY(P04)EEIPI |SEO 
ID NO:23]) was synthesized as a part of a larger peptide |Ac-KGGQY(PQ4)EElPI-NH2 JSEQ ID 
NO:24]J and used to generate a src-sensitive biosensor surface as described above for tandem ZAP. 
Assays were run in HBS using 270 nM pp60src( 144-251) W- test inhibitor. 

EXAMPLE 8: YEAST-BASED TWO-HYBRID SCREENING ASSAYS 

The constitutive as well as inducible Src SH2- and ZAP SH2 dependent two-hybrid systems 
described here using mammalian cells are also useful with yeast cells. A yeast strain which 
contains an integrated reporter gene consisting of the 5-galactosidase gene linked to a promoter 
that is GAL4-dependent has been demonstrated. Two-hybrid formation induces B-galactosidase 
production, which can be detected using an enzyme assay, or from the blue color produced when 
cells producing enzyme are exposed to X-GAL. As with mammalian cells, SH2-dependent 
two-hybrid formation in yeast can be rendered estrogen-dependent using the estrogen receptor 
ligand binding domain. 

A yeast strain such as the one that contains a mutation in the srbl gene is used for the 
two-hybrid assay. Cells containing the tsl allele of srbl exhibit increased endocytosis and have 
generally increased permeability. Cells are engineered so that plasmids used tor the two-hybrid 
screen can be selectively propagated. 
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a) HIS3-1.2.4 triazole Assay 

In this assay, cells auxotrophic tor histidine are transformed with a HIS3 gen controlled by 
a promoter that contains LexA DNA binding sites. The tandem SH2 domains f ZAP are expressed in 
celts link d to th LexA DBD. This fusion prot m binds to DNA. but is not a competent 
5 transcriptional activator. Th TCR z FTAM sequence is also xpressed in the celts, as part f a 
hybrid protein that includes a nuclear localization sequence, the Lck tyrosine kinase catalytic 
domain and a transcriptional activation domain. The Lck kinase domain should be active and in the 
nucleus, with the potential to phosphorylate the tyrosine residues of z IT AM sequence either in cis 
or in trans. 

D The interaction of the phosphorylated z ITAM-transcriptional activation domain fusion 

protein with the ZAP SH2 domains -LexA protein activates HIS3 transcription, generating cells that 
can grow in media lacking histidine. Compounds that interfere with the interaction of the ZAP SH2 
domains with phosphorylated z ITAM will reduce the level of HIS3 transcription. Compounds that 
efficiently block this interaction prevent yeast growth in media lacking histidine. However, 

5 compounds that weakly disrupt ZAP SH2-2 fTAM association are not detected as histidine production 
by the cell must be dramatically reduced to affect viability. 

The sensitivity of this assay can be adapted to be able to detect molecules that weakly block 
ZAP SH2-Z ITAM association. Cells are grown in the presence of 3«amino-1 ,2,4-triazole. a 
competitive inhibitor of histidine. Cells are cultured using a concentration of 

0 3-amino-1.2.4-triazole that is just below the amount required to affect cell viability. Compounds 
that trigger even a small reduction of HIS3 transcription are identified as the affected cells will no 
longer grow. 

There are two counterscreens useful to evaluate whether compounds that score in the above 
assay exert their effect on growth by specifically affecting ZAP SH2-Z ITAM association. In one 

5 countefscreen, the cells are cultured in the presence of histidine. ZAP SH2 inhibitors do not affect 
the g rowth of cells under this conditi on. In a second counters ere en, an identical yeast strain is 
established except that the transcription of HISS is driven by a monomer of the LexA DBD 
genetically fused to the activation domain, thus the transcription of the HIS3 gene is not dependent 
on ZAP SH2-2 ITAM association. 

) Specifically, the yeast host cells in log phase growth are seeded in 96 well dishes (0.1 mis of 

cells) in media that lacks histidine and contains 3-amino-1 ,2,4-triazole. Serial dilutions of 
compound are added to wells. The cultures are incubated 1-2 days at 30°C on a shaker platform. 
The affect of compounds on growth is assessed using a 96 well plate reader. 

This assay depends on growth inhibition (the interference of two-hybrid dependent HIS3 

5 transcription) as the readout. Secondary assays should be performed to confirm that the 
compounds that scored positive are not toxic to yeast. 

b ) URA3/5-FOA Assay: 

The second approach for identifying molecules that inhibit ZAP SH2 binding in yeast scores 
inhibitory compounds based on the ability to block ZAP two-hybrid dependent cell death. In this 

) assay, the tandem ZAP SH2 domain is expressed in yeast fused to the LexA DBD and the z ITAM- Lck 
domain is fused to transcriptional activation as in the constructions described above. However m 
this assay, the reporter gene, which is regulated by the LexA DBD is the URA3 gene. 

Cells which express the URA3 gene product die when exposed to 5-fluoro-orotic acid 
(5-FOA). The product of the URA3 gene converts 5-FOA to fluorodeoxyuridine. which is a potent 

> inhibitor of thymidyiate synthetase. Hence 5-FOA is conditionally toxic to cells. Compounds that 
interfere with tandem ZAP SH2-Z ITAM complex formation reduce the amount of URA3 gene 
product. By carefully titrating the amount of two-hybrid" transcriptional activator in the cell 
and by using the lowest concentration of 5-FOA that will still kill cells, conditions are defined 
where compounds that disrupt the ZAP SH2-Z ITAM interaction are identified. Such molecules 
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educe th URA3 gene product levels (and hence the fluorodeoxyuritfine I vols) below the level 
vhere t xic amounts of fluorodeoxuridine levels are formed. 

In contrast t the assay of 8A above, toxic compounds are not scored when using the 
'URA3/5-FOA* screen, which is a cell viability assay. 

EXAMPLE 9: ASSOCIATION OF SRC PROTEIN WITH TYROSINE PHOSPHORYLATED 

PROTEINS IN VIVO ^ m ^ . 

Several oroteins have been reported to coprecipltate with v-Src from extracts ot 
,-Src-transtormed ceHs (Kanner. S.B. et al.. EMBQ_>L. 1^:1689-1698 (1991); Koch. C.A., MfiL 
~elL_BioL, 12 '1366-1 374 (1992)]. The ability of a high affinity Src SH2-binding peptide 
fEPOpYEEIPI. denoted here as YEEI [SEO ID NO:25J] to block SH2 domain-dependent interactions is 
demonstrated in this assay. 

In this assay. Balb/c 3T3 and SRD-3T3 cells (V-Src transformed 3T3 cells) were lysed in 
RIPA butter and the Src protein was •mmunoprecipitated with a monoclonal antibody to the 
ammo-terminus of Src in the presence or absence of the SH2 and SH3 binding peptides. The 
samples were transferred to nitrocellulose, and the blot was probed with a "^*>nal «^ody to 
phosphotyrostne. Two major tyrosine phosphorated proteins co-migrated with v-Src itremi SRD 
3T3 cal* on. of which co-migrated with the 62 kDa tyrosine-pr^horylated *?™*«*^ 
to the Src SH3 domain, and the other protein of Mr 130.000. which is most likely the 130.000 
kDa protein previously detected in v-Src rmmunoprecipitates [Koch. C.A.. NM well . PlPl .. 

1 2 *1366*1 374 (1992)). 

The binding of Src to the 62 kDa protein was reduced significantly by incubation of the 
extracts wfth the'src-pro" peptide or the phosphorated YEEI fSEQ ID NO:25] £ 
poiy-proline nor the unphosphoryiated YEEI [5EQ ID NO:25] pept.des, suggesting 
and SH3 domains of Src are required for stable association of this protein with Src. This protein 
has not been determined to be the p62 protein identified in the total cell SH3 binding protens since 
the p62 antibody is not sensitive enough to recognize the small amounts of the 62 £Da protejn that 
associate with Src under these conditions. The -Src-pro" pept.de and the pYEEl ISEO ID NO 25] 
pepS could on.y b.ock 50% of pi 30 binding to Src. whereas 90% inhibition of I p130 bmd.ng was 
observed when both peptides were used, suggesting that either the SH2 or the SH3 domain .s 
sufficient for some degree of p130 association with Src. Hd ... . Rin 

This invention is not to be limited in scope by the specific embodiments descnbed herein, 
indeed, various modifications of the invention in addition to those descnbed herein will be <»™ 
apparent to those skilled in the art from the foregoing description. Such modifications are intended 
to fan within the scope ot the appended claims. 

The disclosures of the patents, patent applications and publications cited herein are 
incorporated by reference in their entireties. 
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SEQUENCE U STING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: ARIAD PHARMACEUTICALS, INC. 
RICKLES. Richard J. 

(5) TITLE OF INVENTION: CELL-BASED ASSAY 

(Hi) NUMBER OF SEQUENCES: 25 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: ARIAD Pharmaceuticals, Inc. 

(B) STREET: 26 Landsdowne Street 

(C) CITY: Cambridge 

(D) STATE: Massachusetts 

(E) COUNTRY: USA 

(F) ZIP: 02139-4234 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-OOS 
(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/012.218 
(B) FILING DATE: 23-FEB-1996 

(vin) ATTORNEY/AGENT INFORMATION: 

(A) NAME: BERSTEIN, David L. 

(B) REGISTRATION NUMBER: 31.235 

(C) REFERENCE/DOCKET NUMBER: ARIAD 336-PCT 

(be) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-494-0400 

(B) TELEFAX: 617-494-0208 

(2) INFORMATION FOR SEQ ID NO:1: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(11) MOLECULE TYPE: cDNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1 : 
CGGAATTCTC CAAGCCCCAG ACCCA 
(2) INFORMATION FOR SEQ ID NO:2: 

0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

>0 

GCGGATCCCT CAGCGACCTC CAACA 
(2) INFORMATION FOR SEQ ID NO:3: 

2 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

3 5 CGGAATTCTC TGCTGGAGAC ATGAGAGCT 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

4 o (A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

4 5 (n) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
CGGAATTCGA CTGTGGCAGG GAAACCCT 
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(2) INFORMATION FOR SEO ID N05: 

(t) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOS: 
GCGGATCCGG AGCTGGGGAA GAACTCA 27 
(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

ACG CGTCGAC TTATAAATCA ATGGGAGGAG 30 
(2) INFORMATION FOR SEQ ID NO:7: 

{lj SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Hnear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 
CCTCAOGAAT TCGGTGGAGT GAOCACCTTT GTGGCC 36 
(2) INFORMATION FOR SEQ ID NO*: 

(i) SEQUB4CE CHARACTERISTICS: 

(A) LENGTH: 35 base pate 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(if) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEO ID NO*: 

^ACTCGGAT CCGCCGGGGC AGACGGTGGT GAGGC 35 
i INFORMATION FOR SEQ ID NO:* 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic actd 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

rrCACGAAT TCGGCGACTC CATCCAGGCT GAGGAG 36 

i INFORMATION FOR SEQ ID NO:10: 

(0 SEQUENCE CHARACTERISTICS: 

(A)1^NGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(if) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0 10: 

^\CTCGGAT CCGCCGGGGC ACACGGTGGT GAGGC 35 
. INFORMATION FOR SEQ ID NO:11: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: singte 

(D) TOPOLOGY: linear 

(it) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11: 

rrCAOGAAT TCGGTGGAGT GACCACCTTT GTGGCC 36 
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(2) INFORMATION FOR SEQ ID NO:12: 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(I?) MOLECULE TYPE: cONA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 
CCAXJTCGt&TCCGCCGGGtt 35 
(2) INFORMATION FOR SEQ ID NO:13: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(!) MOLECULE TYPE: cDNA 

(xi)^EQUENCE^DESCRIPnON: SEQ ID NO:13: 
CCTCACGAAT TCGGCGACTC CATCCAGGCT GAGGAG 36 
(2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: stngie 

(D) TOPOLOGY: linear 

(I0 MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 
CGACTOGGATCCGCOGGGGCACAOGGTGCTG^^^ 35 
(2) INFORMATION FOR SEQ ID NO:15: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEO ID NO:15: 
aGGATCCAG CCTGGQGGAC GAGCTC 26 
) INFORMATION FOR SEO ID NO:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pare 

(B) TYPE: nucleic acid 

(C) STRANDEDN ESS: single 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: cDNA 

<xf) SEQUENCE DESCRIPTION: SEO ID NO:16: 

GACTAGTCC CACCGTACTC GTCAAT 26 

) INFORMATION FOR SEO ID NO:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

" (Tl) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: ModHied-stte 
(B) LOCATION: 4 

(D) OTHER INFORMATION: mote= •phosphorylaled" 

(Ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 15 

(D) OTHER INFORMATION: /note- •phosphorylated' 
(») SEQUENCE DESCRIPTION: SEO ID NO:17: 

Aso Glv Val Tyr Thr Gly Leu Ser Thr Arg Asn Gin Glu Thr Tyr Glu 
15 10 15 

Thr Leu Lys 
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(2) INFORMATION FOR SEQ ID NO:18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modffied-stte 
(B) LOCATION: 1 

(D) OTHER INFORMATION: /note- "N-acetytated" 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 7 

(D) OTHER INFORMATION: /note= "phosphorylatecT 

(ix) FEATURE: 

(A) NAME/KEY: Modlfied-site 
(B) LOCATION: 18 

(D) OTHER INFORMATION: /note- "phosphorylated" 



(ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 22 

(D) OTHER INFORMATION: /note= "amidated" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

Cys Gly Gly Asp Gly Val Tyr Thr Gly Leu Ser Thr Arg Asn Gin Glu 
15 10 15 

Thr Tyr Glu Thr Leu Lys 
20 

(2) INFORMATION FOR SEQ ID NO:19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino add 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ff) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

(A) NAME/KEY: ModHled-site 
(B) LOCATION: 4 

(D) OTHER INFORMATION: /note= -phosphorytated" 

(xf) SEQUENCE DESCRIPTION: SEO ID NO:19: 

Asp Gly Val Tyr Thr Gly Leu Ser Thr Arg Asn Gin GHi Thr Tyr Glu 
1 „. I 5 10 15 

Thr Leu Lys 



) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino adds 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(II) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modifled-stte 
(B) LOCATION: 1 

(D) OTHER INFORMATION: /note= "acetyiated* 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 
(B) LOCATION: 7 

(D) OTHER INFORMATION: /note* "phosphorytated" 

(ix) FEATURE: 

(A) NAME/KEY: ModWed-sHe 
(B) LOCATION: 22 

(D) OTHER INFORMATION: /note** "amWated" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO20: 

Cys Gly Gly Asp Gty Val Tyr Thr Gly Leu Ser Thr Arg Asn Gin Glu 
15 10 15 

Thr Tyr Glu Thr Leu Lys 
20 

>) INFORMATION FOR SEQ ID N021 : 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(if) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modified-slte 
(B) LOCATION: 4 

<D) OTHER INFORMATION: /note= •phosphoryiatecT 

(ix) FEATURE: 

(A) NAME/KEY: Modifled-site 
(B) LOCATION: 15 

(D) OTHER INFORMATION: /note- ■phosphorytetecf 

(xi) SEQUENCE DESCRIPTION: SEQ ID N021: 

Asn Gin Leu Tyr A6n Glu Leu Asn lie Gly Arg Arg Glu Glu Tyr Asp 
15 10 15 

Val Leu Asp 



(2) INFORMATION FOR SEQ ID N022: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: Modifled-site 
(B) LOCATION: 1 

(D) OTHER INFORMATION: /note= "N-acetytated* 

(bc) FEATURE; 

(A) NAME/KEY: Modlfied-site 
(B) LOCATION: 7 

(D) OTHER INFORMATION: /hotte "phosphorylated' 

(ix) FEATURE: 

(A) NAME/KEY: Modlfied-site 
(B) LOCATION: 18 

(D) OTHER INFORMATION: /note* -phosphoryiatecT 
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(ix) FEATURE: 

(A) NAME/KEY: Moditted-stte 
(B) LOCATION: 22 

(D) OTHER INFORMATION: /note= 'amidsted* 

<xQ SEQUENCE DESCRIPTION: SEO ID N022: 

Lys Gly Gly Asn Gin Leu Tyr Asn Glu Leu Asn lie Gly Arg Arg GJu 
, 5 10 15 

Gki Tyr Asp Val Leu Asp 
20 

I INFORMATION FOR SEO ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino adds 

(B) TYPE: -amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(II) MOLECULE TYPE: peptide 

(Ix) FEATURE: 

(A) NAME/KEY: Modlfled-slte 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "phosphorylated" 

(xt) SEQUENCE DESCRIPTION: SEQ ID N023: 

Gta Tyr Glu Glu lie Pro lie 
1 5 

I INFORMATION FOR SEQ ID NO:24: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 0 amino acids 

(B) TYPE: amino add 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Ix) FEATURE: 

(A) NAME/KEY: Modlfled-slte 
(B) LOCATION: 1 

(D) OTHER INFORMATION: /notes "N-acetylated* 
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(ix) FEATURE: 

(A) NAME/KEY: Modified-stte 
(B) LOCATION: 5 

(D) OTHER INFORMATION: /note- •phosphorylated" 

(ix) FEATURE: 

(A) NAME/KEY: Modffied-site 
(B) LOCATION: 10 

<D) OTHER INFORMATION: /note= "amldated" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N024: 

Lys Gly Gly Gin Tyr Glu Glu lie Pro lie 
1 5 10 

(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(B) MOLECULE TYPE: peptide 
— (xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

Tyr Glu Giu lie 
1 
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CLAIMS 



A cell containing and capable of expressing r combcnant DNA molecules encoding two fusion 

>teins, ^ , . 

( a ) a first fusion protein comprising one or more copies of each of a transcription 
ivalion domain and one member of a phosphopeptlde binding pair, 

( b ) a second fusion protein comprising one or more copies of each of a DNA-binding domain 
1 the other member of the phosphopeptlde binding pair, 

wherein the phosphopeptlde binding pair comprises (i) a peptide ligand containing one or 
»re tyrosine residues and (li) a phosphopeptlde binding domain capable of binding to the peptide 
ind upon phosphorylation of one or more of the peptide ligantTs tyrosine residues, and 

wherein the cell further contains a reporter gene which is linked to a DNA sequence to which 
d second fusion protein is capable of binding and which reporter gene encodes a detectable gene 
>duct which is expressed when said transgene is activated by proximity to a transcriptional 
tivation domain. 

The cell according to claim 1 wherein said phosphopeptlde binding pair comprises an SH2 
main and a ligand therefor, a phosphatase domain and a ligand therefor, or a PTB phosphopeptlde 
iding domain and a ligand therefor. 

The cell according to claim 1 wherein the recombinant DNA molecules encode 

( a ) a first fusion protein which comprises one or more copies of a transcription activation 

main and a phosphopeptlde binding domain, and nMA .. HrtrMln 

( b ) a second fusion protein which comprises one or more copies of a DNA-binding domain 

d the peptidejigand capable, upon phosphorylation, of binding to the phosphopeptlde binding 

main. 

The cell according to claim 3 wherein the second fusion protein further comprises a kinase 
main. 

The cell according to claim 3 wherein the first fusion protein contains two phosphopeptlde 
iding domains. 

The cell according to claim 1 wherein one of the fusion proteins further contains a hormone 
iding domain permitting regulated sequestering of the fusion protein. 

The cell according to claim 1 , which is mammalian. 

The cell according to claim 1. further comprising an endogenous kinase domain. 
A method for identifying the presence in a test composition of an inhibitor of the binding of a 
wsphopeptide binding pair which comprises the steps of: 

(a) providing cells of claim 4 or 8, which when cultured are capable of producing 
asurable amounts of said detectable gene product. 

( b ) culturing said cells in the presence and absence of the test composition, and 

( c ) determining whether production of the detectable gene product was diminished In the 
esence of the test composition. 

0. The method according to claim 9 wherein said cell is a mammalian cell. 
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11 . The method according t claim 9, wherein the DNA-binding domain and transcriptional 
activation domain are derived from transcriptional activators having separable DNA-binding and 
transcriptional activation domains. 

12. The method according to claim 11, wherein the DNA binding domain and the transcriptional 
activation domain are selected from the group consisting of transcriptional activators GAL4, GCN4 
and ADR1. 

13. The method according to claim 9, wherein the DNA-binding domain and the transcriptional 
activation domain are from different transcriptional activators. 

1 4 . The method according to claim 9 wherein said phosphopeptide binding pair comprises an SH2 
domain and a ligand therefor, a phosphatase domain and a ligand therefor, or a PID phosphopeptide 
binding domain and a ligand therefor. 

1 5. The method according to claim 9 wherein the first fusion protein contains two phosphopeptide 
binding domains. 

1 6. The method according to claim 9 wherein one of the fusion proteins further contains a 
hormone binding domain permitting regulated sequestering of the fusion protein. 

1 7 . The method according to claim 9 wherein said test composition is selected from a mixture of 
one or more test peptides, wherein sard mixture is provided in the form of a library of synthetic 
peptides or in the form of a phage library displaying the various peptides. 

1 B . The method according to claim 9 wherein said polypeptide ligand is an immunoreceptor 
tyrosine activation motif. 

1 9. The method according to claim 18 wherein said ITAM is the TCR zeta ITAM. 
20. An inhibitor identified by the method of claim 9. 
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